kopia

mirror of https://github.com/kopia/kopia.git synced 2026-03-12 03:06:31 -04:00

Author	SHA1	Message	Date
Erkki Seppälä	6a93e4d5b9	Added support for scanning only one filesystem via files policy (#676 ) The new files policy oneFileSystem ignores files that are mounted to other filesystems similarly to tar's --one-file-system switch. For example, if this is enabled, backing up / should now automatically ignore /dev, /proc, etc, so the directory entries themselves don't appear in the backup. The value of the policy is 'false' by default. This is implemented by adding a non-windows-field Device (of type DeviceInfo, reflecting the implementation of Owner) to the Entry interface. DeviceInfo holds the dev and rdev acquired with stat (same way as with Owner), but in addition to that it also holds the same values for the parent directory. It would seem that doing this in some other way, ie. in ReadDir, would require modifying the ReadDir interface which seems a too large modification for a feature this small. This change introduces a duplication of 'stat' call to the files, as the Owner feature already does a separate call. I doubt the performance implications are noticeable, though with some refactoring both Owner and Device fields could be filled in in one go. Filling in the field has been placed in fs/localfs/localfs.go where entryFromChildFileInfo has acquired a third parameter giving the the parent entry. From that information the Device of the parent is retrieved, to be passed off to platformSpecificDeviceInfo which does the rest of the paperwork. Other fs implementations just put in the default values. The Dev and Rdev fields returned by the 'stat' call have different sizes on different platforms, but for convenience they are internally handled the same. The conversion is done with local_fs_32bit.go and local_fs_64bit.go which are conditionally compiled on different platforms. Finally the actual check of the condition is in ignorefs.go function shouldIncludeByDevice which is analoguous to the other similarly named functions. Co-authored-by: Erkki Seppälä <flux@inside.org>	2020-10-14 22:45:32 -07:00
Jarek Kowalski	9d7cf71a37	Logging flags (#674 ) * logging: cleaned up stderr logging - do not show module - do not show timestamps by default (enable with --console-timestamps) * logging: replaced most printStderr() with log.Info * cli: additional logging cleanup	2020-10-10 10:48:37 -07:00
Jarek Kowalski	ec9c4d6095	restore: support for parallelization (#668 )	2020-10-07 21:41:32 -07:00
Jarek Kowalski	8659b45a44	cli: added --force-color and --disable-color flags (#664 ) By default Kopia will emit colored output if the output is a non-dumb terminal. You can use --force-color (or set environment variable to KOPIA_FORCE_COLOR=true) to override it and emit ANSI color sequences, which is useful for example when piping through 'less'. Conversely --disable-color (or KOPIA_DISABLE_COLOR=true) will prevent color output.	2020-10-04 17:31:56 -07:00
Jarek Kowalski	6b756bad40	fshasher: truncate timestamps to full seconds when comparing to accomodate filesystems that lose precision (#661 )	2020-10-03 15:15:24 -07:00
Jarek Kowalski	f66fe5789e	Eliminated busy loop after snapshot failure (#658 ) * server: if a snapshot fails, don't start the next one for 5 minutes or until the next successful refresh. * Makefile: don't print skipped tests	2020-10-02 19:48:21 -07:00
Jarek Kowalski	044b170915	testing: fixed deadlock in faketime_test (#655 ) TestTimeAdvanceConcurrent was depending on t.Parallel() to be scheduled quickly, which is not guaranteed. Fixes #654	2020-09-29 23:28:03 -07:00
Jarek Kowalski	66cebb79cb	Fixed empty object IDs in checkpoints (#649 ) * object: fixed race condition between Result() and Checkpoint() This would sometimes result in indirect objects having empty object IDs. Fixes #648 * upload: ensure checkpoints never containt empty object IDs. * testing: reduce armhf test weight	2020-09-29 07:14:47 -07:00
Jarek Kowalski	0758a92c58	restore: improved user experience (#644 ) * restore: improved user experience * 'snapshot restore' is now the same as 'restore' and both will support restoring by manifest ID, root ID or root ID + subdirectory * added support for restoring individual files * implemented PR feedback and refactored object ID parsing Moving helpers inside the snapshot/ package helped clean up the code a lot.	2020-09-28 22:57:24 -07:00
Jarek Kowalski	fd24227379	b2: fixed handling of 'no_such_file' to indicate NOT_FOUND (#646 ) Fixes #645	2020-09-26 21:01:04 -07:00
Jarek Kowalski	ff6a414ec5	cli: When listing directory that had errors, print error summary at the end. (#643 ) Can be disabled with `--no-error-summary`. Quick demo: https://asciinema.org/a/2rma0sx2mD6HoIPy6VL0QEFeP Also refactored fs.Directory to provide Summary optionally.	2020-09-25 09:06:41 -07:00
Jarek Kowalski	c9c8d27c8d	Repro and fix for zero-sized snapshot bug (#641 ) * server: repro for zero-sized snapshot bug As described in https://kopia.discourse.group/t/kopia-0-7-0-not-backing-up-any-files-repro-needed/136/5 * server: fixed zero-sized snapshots after repository is connected via API The root cause was that source manager was inheriting HTTP call context which was immediately closed after the 'connect' RPC returned thus silently killing all uploads.	2020-09-23 20:15:36 -07:00
Jarek Kowalski	fce9497375	restore: support for symlinks (experimental) (#621 )	2020-09-18 10:29:20 -07:00
Jarek Kowalski	7cdb75ab79	fuse: changed file read implementation to avoid OOM (#620 ) Changed file read implementation from ReadAll() to a Handle to avoid OOMing We don't have automated tests for this but I verified this by restoring 13GB file over fuse and memory usage never exceeded 400MB.	2020-09-16 23:04:22 -07:00
Jarek Kowalski	f2cf71d914	logging: revamped logs from content manager to be machine parseable (#617 ) * logging: revamped logs from content manager to be machine parseable Logs from the content manager (except reads) are sent to separate log file that is always free from personally-identifiable information (e.g. no file names, just content IDs and blob IDs). Also moved CLI logs to a subdirectory (cli-logs) and put content logs in a parallel directory (content-logs) Also, the log file name will now include the type of the command that was invoked: kopia-20200913-134157-16110-snapshot-create.log Fixes #588 * tests: moved all logs from tests to a separate directory	2020-09-16 20:04:26 -07:00
Julio López	67ed3a9f96	Remove maintenance lock file on disconnect (#616 ) * Remove maintenance lock file on disconnect * Remove workaround for maintenance lock file in repotesting	2020-09-13 11:18:29 -07:00
Julio López	64b6018140	Test for directory reuse after GC (#601 ) content:Allow returning deleted content in GetContent maintenance: check deleted contents as well maintenance: test for when a directory content is reused after deletion testing: add support for repo open options in repotesting * Allow passing repo options to MustReopen * Add repotesting.Environment.MustConnectOpenAnother * Remove kopia.config.mlock file * snapshot create helper * Fix content delete related and e2e tests	2020-09-12 19:28:52 -07:00
Julio López	acc98d89b7	Trivial test nits (#602 ) * Ensure other repo is closed * Prefer testlogging.Context in tests * Prefer T.TempDir() in repotesting.Environment.Setup()	2020-09-10 17:26:03 -07:00
Julio López	70df5f738c	testing: Refactor faketime (#597 ) * Allow auto-advance in faketime.TimeAdvance * Leverage TimeAdvance in faketime.AutoAdvance * Concurrent test for faketime.AdvanceTime	2020-09-10 00:52:14 -07:00
Jarek Kowalski	3b87902433	Kopia UI improvements for repository management (#592 ) * cli: added --tls-print-server-cert flag This prints complete server certificate that is base64 and PEM-encoded. It is needed for Electron to securely connect to the server outside of the browser, since there's no way to trust certificate by fingerprint. * server: added repo/exists API * server: added ClientOptions to create and connect API * server: exposed current-user API * server: API to change description of a repository * htmlui: refactored connect/create flow This cleaned up the code a lot and made UX more obvious. * kopia-ui: simplified repository management UX Removed repository configuration window which was confusing due to the notion of 'server'. Now KopiaUI will automatically launch 'kopia server --ui' for each config found in the kopia config directory and shut it down every time repository is disconnected. See https://youtu.be/P4Ll_LR4UVM for a quick demo. Fixes #583	2020-09-07 08:00:19 -07:00
Jarek Kowalski	29ce1819cb	Added support for setting and changing repository client options (description, read-only, hostname, username) (#589 ) * repo: refactored client-specific options (hostname,username,description,readonly) into new struct that is JSON-compatible with current config * cli: added 'repository set-client' to configure parameters of connected repository * cli: cleaned up 'repository status' output	2020-09-04 13:57:15 -07:00
Jarek Kowalski	a5838ff34c	Improvements to UX for mounting directories (both CLI and KopiaUI) (#573 ) * cli: simplified mount command See https://youtu.be/1Nt_HIl-NWQ It will always use WebDAV on Windows and FUSE on Unix. Removed confusing options. New usage: $ kopia mount [--browse] Mounts all snapshots in a temporary filesystem directory (both Unix and Windows). $ kopia mount <object> [--browse] Mounts given object in a temporary filesystem directory (both Unix and Windows). $ kopia mount <object> z: [--browse] Mounts given object as a given drive letter in Windows (using temporary WebDAV mount). $ kopia mount <object> * [--browse] Mounts given object as a random drive letter in Windows. $ kopia mount <object> /mount/path [--browse] Mounts given object in given path in Unix. <object> can be the ID of a directory 'k<hash>' or 'all' Optional --browse automatically opens OS-native file browser. * htmlui: added UI for mounting directories See https://youtu.be/T-9SshVa1d8 for a quick demo. Also replaced some UI text with icons. * lint: windows-specific fix	2020-09-03 17:46:48 -07:00
Jarek Kowalski	c242235a32	blob: added SetTime() method which may be optionally implemented by blob.Storage (#575 ) cli: added --times option to 'repository sync'	2020-08-31 19:50:15 -07:00
Jarek Kowalski	1a8fcb086c	Added endurance test which tests kopia over long time scale (#558 ) Globally replaced all use of time with internal 'clock' package which provides indirection to time.Now() Added support for faking clock in Kopia via KOPIA_FAKE_CLOCK_ENDPOINT logfile: squelch annoying log message testenv: added faketimeserver which serves time over HTTP testing: added endurance test which tests kopia over long time scale This creates kopia repository and simulates usage of Kopia over multiple months (using accelerated fake time) to trigger effects that are only visible after long time passage (maintenance, compactions, expirations). The test is not used part of any test suite yet but will run in post-submit mode only, preferably 24/7. testing: refactored internal/clock to only support injection when 'testing' build tag is present	2020-08-26 23:03:46 -07:00
Jarek Kowalski	f41e904a01	logging: changed default file log level to debug	2020-08-22 06:38:24 -07:00
Jarek Kowalski	7ae823945c	Experimental rclone backend (#545 ) This will launch 'rclone webdav server' passing random TLS certificate and username/password and serve predefined rclone remote path. This is very experimental, use with caution. Fixes #313. Additional / required changes: * blob: (experimental) support for rclone provider * server: refactored TLS utilities to separate package * webdav: add support for specifying trusted TLS certificate fingerprint * kopia-ui: added rclone support	2020-08-17 20:43:41 -07:00
Jarek Kowalski	48f253173b	kopia-ui: added ability to connect to kopia server and few other minor tweaks (#546 ) * kopia-ui: added ability to connect to kopia server * kopia-ui: update status page to show some data for repositories connected to API server * kopia-ui: hide user@host selection dropdown for kopia server repositories	2020-08-16 17:57:37 -07:00
Jarek Kowalski	27ec5c70a9	server: pre-read request body to fix HTTP/2 deadlock (#539 ) Fixes #538 (hopefully)	2020-08-15 21:53:46 -07:00
Jarek Kowalski	9a6dea898b	Linter upgrade to v1.30.0 (#526 ) * fixed godot linter errors * reformatted source with gofumpt * disabled some linters * fixed nolintlint warnings * fixed gci warnings * lint: fixed 'nestif' warnings * lint: fixed 'exhaustive' warnings * lint: fixed 'gocritic' warnings * lint: fixed 'noctx' warnings * lint: fixed 'wsl' warnings * lint: fixed 'goerr113' warnings * lint: fixed 'gosec' warnings * lint: upgraded linter to 1.30.0 * lint: more 'exhaustive' warnings Co-authored-by: Nick <nick@kasten.io>	2020-08-12 19:28:53 -07:00
Jarek Kowalski	505ab92e21	Support for repository sync (#522 ) * blob: added DisplayName() method to blob.Storage * cli: added 'kopia repo sync-to <provider>' which replicates BLOBs Usage demo: https://asciinema.org/a/352299 Fixes #509 * implemented suggestion by Ciantic to fail sync if the destination repository is not compatible with the source * cli: added 'kopia repo sync --must-exist' This ensures that target repository is not empty, otherwise syncing to an accidentally unmounted filesystem directory might copy everything again.	2020-08-09 12:36:41 -07:00
Jarek Kowalski	40acf238f3	Fixed arm and arm64 build. (#506 ) * fixed a number of cases where misaligned data was causing panics on armv7 (but not armv8) * travis: enable arm64 * test: reduce compressed data sizes when running on arm * arm: wait longer for snapshots	2020-07-30 17:31:28 -07:00
Jarek Kowalski	7e9ce61f9e	server: automatically flush the repository after setting or deleting a policy (#489 ) Fixes #479	2020-07-20 20:59:21 -07:00
Jarek Kowalski	64a6cb42dc	parallelwork: fixed error handling, which caused parallel work to never finish on any error	2020-06-24 08:39:56 -07:00
Jarek Kowalski	79757672ca	server: implemented 'flush' and 'refresh' API Added test that verifies that when client performs Flush (which happens at the end of each snapshot and when repository is closed), the server writes new blobs to the storage. Fixes #464	2020-06-07 19:38:13 -07:00
Pavan Navarathna	c13b5f820f	Remove extra whitespaces and fix minor typos (#460 )	2020-06-01 13:40:57 -07:00
Jarek Kowalski	960c33475e	maintenance: disabled automatic compaction on repository opening instead moved to run as part of maintenance ('kopia maintenance run') added 'kopia maintenance run --force' flag which runs maintenance even if not owned	2020-06-01 00:57:32 -07:00
Jarek Kowalski	d68273a576	Improvements for dealing with eventually-consistent stores (S3) (#437 ) * content: added support for cache of own writes Thi keeps track of which blobs (n and m) have been written by the local repository client, so that even if the storage listing is eventually consistent (as in S3), we get somewhat sane behavior. Note that this is still assumming read-after-create semantics, which S3 also guarantees, otherwise it's very hard to do anything useful. * compaction: support for compaction logs Instead of compaction immediately deleting source index blobs, we now write log entries (with `m` prefix) which are merged on reads and applied only if the blob list includes all inputs and outputs, in which case the inputs are discarded since they are known to have been superseded by the outputs. This addresses eventual consistency issues in stores such as S3, which don't guarantee list-after-put or list-after-delete. With such stores the repository is ultimately eventually consistent and there's not much that can be done about it, unless we use second strongly consistent storage (such as GCS) for the index only. * content: updated list cache to cache both `n` and `m` * repo: fixed cache clear on windows Clearing cache requires closing repository first, as Windows is holding the files locked. This requires ability to close the repository twice. * content: refactored index blob management into indexBlobManager * testing: fixed blobtesting.Map storage to allow overwrites * blob: added debug output String() to blob.Metadata * testing: added indexBlobManager stress test This works by using N parallel "actors", each repeatedly performing operations on indexBlobManagers all sharing single eventually consistent storage. Each actor runs in a loop and randomly selects between: - reading all contents in indexes and verifying that it includes all contents written by the actor so far and that contents are correctly marked as deleted - creating new contents - deleting one of previously-created contents (by the same actor) - compacting all index files into one The test runs on accelerated time (every read of time moves it by 0.1 seconds) and simulates several hours of running. In case of a failure, the log should provide enough debugging information to trace the exact sequence of events leading up to the failure - each log line is prefixed with actorID and all storage access is logged. * makefile: increase test timeout * content: fixed index blob manager race The race is where if we delete compaction log too early, it may lead to previously deleted contents becoming temporarily live again to an outside observer. Added test case that reproduces the issue, verified that it fails without the fix and passed with one. * testing: improvements to TestIndexBlobManagerStress test - better logging to be able to trace the root cause in case of a failure - prevented concurrent compaction which is unsafe: The sequence: 1. A creates contentA1 in INDEX-1 2. B creates contentB1 in INDEX-2 3. A deletes contentA1 in INDEX-3 4. B does compaction, but is not seeing INDEX-3 (due to EC or simply because B started read before #3 completed), so it writes INDEX-4==merge(INDEX-1,INDEX-2) * INDEX-4 has contentA1 as active 5. A does compaction but it's not seeing INDEX-4 yet (due to EC or because read started before #4), so it drops contentA1, writes INDEX-5=merge(INDEX-1,INDEX-2,INDEX-3) * INDEX-5 does not have contentA1 7. C sees INDEX-5 and INDEX-5 and merge(INDEX-4,INDEX-5) contains contentA1 which is wrong, because A has been deleted (and there's no record of it anywhere in the system) * content: when building pack index ensure index bytes are different each time by adding 32 random bytes	2020-05-31 17:11:20 -07:00
Jarek Kowalski	8c4fb53c96	blob: support for GetMetadata() to get server-side timestamp and blob length (#440 )	2020-05-18 11:06:34 -07:00
Jarek Kowalski	d657415817	testing: added blob.Storage wrapper that simulates eventual consistency (#434 ) This is done by introducing N unsynchronized caches, which simulate what frontend of a cloud storage system might do, that causes eventual consistency behavior.	2020-05-09 12:19:32 -07:00
Jarek Kowalski	be4b897579	Support for remote repository (#427 ) Support for remote content repository where all contents and manifests are fetched over HTTP(S) instead of locally manipulating blob storage * server: implement content and manifest access APIs * apiclient: moved Kopia API client to separate package * content: exposed content.ValidatePrefix() * manifest: added JSON serialization attributes to EntryMetadata * repo: changed repo.Open() to return Repository instead of DirectRepository repo: added apiServerRepository * cli: added 'kopia repository connect server' This sets up repository connection via the API server instead of directly-manipulated storage. * server: add support for specifying a list of usernames/password via --htpasswd-file * tests: added API server repository E2E test * server: only return manifests (policies and snapshots) belonging to authenticated user	2020-05-02 21:41:49 -07:00
Jarek Kowalski	1377d057e4	Maintenance changes (#423 ) * maintenance: encrypt maintenance schedule block * maintenance: created snapshotmaintenance package that wraps maintenance and performs snapshot GC + regular maintenance in one shot, used in CLI and server * PR feedback.	2020-05-02 20:40:16 -07:00
Jarek Kowalski	4b4628a21e	Repository maintenance support (#411 ) Maintenance: support for automatic GC Moved maintenance algorithms from 'cli' to 'repo/maintenance' package Added support for CLI commands: kopia gc - performs quick maintenance kopia gc --full- perform full maintenance Full maintenance performs snapshot gc, but it's not safe to do this automatically possibly in parallel to snapshots being taken. This will be addressed ~0.7 timeframe.	2020-04-14 00:11:41 -07:00
Jarek Kowalski	1f1682b2cc	Snapshot checkpointing (#410 ) * snapshot: support for periodic checkpointing of snapshots in progress For each snapshot that takes longer than 45 minutes, we trigger internal cancellation, save the manifest and restart the snapshot at which point all files will be cached. This helps ensure the property that no file or directory objects in the repository remain unreachable from a snapshot root for more than one hour, which is important from GC perspective. * nit: unified spelling 'cancelled' => 'canceled'	2020-04-07 17:54:21 -07:00
Jarek Kowalski	057c2789d8	Kopia UI: support for multiple repositories + portability (#398 ) * server: when serving HTML UI, prefix the title with string from KOPIA_UI_TITLE_PREFIX envar * kopia-ui: support for multiple repositories + portability This is a major rewrite of the app/ codebase which changes how configuration for repositories is maintained and how it flows through the component hierarchy. Portable mode is enabled by creating 'repositories' subdirectory before launching the app. on macOS: <parent>/KopiaUI.app <parent>/repositories/ On Windows, option #1 - nested directory <parent>\KopiaUI.exe <parent>\repositories\ On Windows, option #2 - parallel directory <parent>\some-dir\KopiaUI.exe <parent>\repositories\ In portable mode, repositories will have 'cache' and 'logs' nested in it.	2020-04-04 17:18:37 -07:00
Jarek Kowalski	6cb9b8fa4f	repo: refactored public API (#318 ) * This is 99% mechanical: Extracted repo.Repository interface that only exposes high-level object and manifest management methods, but not blob nor content management. Renamed old repo.Repository to repo.DirectRepository Reviewed codebase to only depend on repo.Repository as much as possible, but added way for low-level CLI commands to use DirectRepository. * PR fixes	2020-03-26 08:04:01 -07:00
Jarek Kowalski	10bb492926	repo: deprecated NONE algorithm, will not be available for new repositories (#395 ) * repo: deprecated NONE algorithm, will not be available for new repositories Co-authored-by: Julio López <julio+gh@kasten.io>	2020-03-24 23:19:20 -07:00
Jarek Kowalski	60977812f0	Support for gather writes (#373 ) , where blob.Storage.PutBlob gets a list of slices and writes them sequentially * performance: added gather.Bytes and gather.WriteBuffer They are similar to bytes.Buffer but instead of managing a single byte slice, they maintain a list of slices that and when they run out of space they allocate new fixed-size slice from a free list. This helps keep memory allocations completely under control regardless of the size of data written. * switch from byte slices and bytes.Buffer to gather.Bytes. This is mostly mechanical, the only cases where it's not involve blob storage providers, where we leverage the fact that we don't need to ever concatenate the slices into one and instead we can do gather writes. * PR feedback	2020-03-24 15:05:52 -07:00
Jarek Kowalski	b08d394864	policy: deduplicate multiple policies for the same source in policy manager, fixes #391	2020-03-23 23:52:23 -07:00
Jarek Kowalski	9b68a631e6	Highlight snapshot errors in the UI and CLI (#376 ) * upload: exposed numFailed and failedEntries on directory summary * cli: better present snapshot errors * htmlui: display snapshot errors	2020-03-22 14:18:47 -07:00
Jarek Kowalski	239d809075	performance: introduced buf.Pool which helps reuse memory buffers (#345 ) * performance: added buf.Pool which can be used to manage ephemeral buffers for encryption and compression * repo: switched object writer to buf.Pool * content: switched encryption to use buf.Pool * object: switched compression to use buf.Pool * testing: added missing content manager Close()	2020-03-18 20:42:16 -07:00

1 2 3 4 5 ...

267 Commits