kopia

mirror of https://github.com/kopia/kopia.git synced 2026-03-13 19:57:27 -04:00

Author	SHA1	Message	Date
Jarek Kowalski	9a6dea898b	Linter upgrade to v1.30.0 (#526 ) * fixed godot linter errors * reformatted source with gofumpt * disabled some linters * fixed nolintlint warnings * fixed gci warnings * lint: fixed 'nestif' warnings * lint: fixed 'exhaustive' warnings * lint: fixed 'gocritic' warnings * lint: fixed 'noctx' warnings * lint: fixed 'wsl' warnings * lint: fixed 'goerr113' warnings * lint: fixed 'gosec' warnings * lint: upgraded linter to 1.30.0 * lint: more 'exhaustive' warnings Co-authored-by: Nick <nick@kasten.io>	2020-08-12 19:28:53 -07:00
Jarek Kowalski	984480908b	Updated documentation for v0.6.0 release (#525 ) * cli: small tweaks to kopia server mode * print SHA256 certficate thumbprint for auto-generated certs. * client will accept both upper- and lowercase thumbprint values * site: updated documentation for v0.6.0 release Co-authored-by: Julio López <julio+gh@kasten.io>	2020-08-10 21:11:48 -07:00
Jarek Kowalski	505ab92e21	Support for repository sync (#522 ) * blob: added DisplayName() method to blob.Storage * cli: added 'kopia repo sync-to <provider>' which replicates BLOBs Usage demo: https://asciinema.org/a/352299 Fixes #509 * implemented suggestion by Ciantic to fail sync if the destination repository is not compatible with the source * cli: added 'kopia repo sync --must-exist' This ensures that target repository is not empty, otherwise syncing to an accidentally unmounted filesystem directory might copy everything again.	2020-08-09 12:36:41 -07:00
Jarek Kowalski	04fdd0105e	Fix for policy manager not setting labels in some cases (#517 ) * cli: fixed 'kopia policy rm' deleting global when passed policy ID * policy: additional unit test coverage for policy manager * fixed path parsing logic to avoid the use filepath package which is platform-dependent, added more tests	2020-08-06 21:25:48 -07:00
Jarek Kowalski	9de7348a75	cli: auto-upgrade the repository to 0.6.0 (#512 ) On first 'kopia snapshot' or 'kopia server' use where the 'maintenance' manifest cannot be found, we create a default one and display usage note.	2020-08-04 23:53:18 -07:00
Jarek Kowalski	fca8155283	update_check: clean up version numbers to always include 'v' prefix	2020-07-31 22:35:20 -07:00
Jarek Kowalski	8ead49b779	restore: support for zip, tar and tar.gz restore outputs (#482 ) * restore: support for zip, tar and tar.gz restore outputs Moved restore functionality to its own package. * Fix enum values in the 'mode' flag Co-authored-by: Julio López <julio+gh@kasten.io>	2020-07-22 22:56:11 -07:00
Jarek Kowalski	1c2534199c	cli: fixed panic() during snapshot verification caused by refactoring to new Repository interface	2020-06-24 08:39:56 -07:00
Jarek Kowalski	3843ed6727	cli: added content verify --include-deleted flag	2020-06-24 08:39:56 -07:00
Jarek Kowalski	25934a544d	content: fixed deletion of cleanup blobs (#462 ) * content: ensure that cleanup blobs have unique contents to prevent situation where they keep getting rewritten and thus never deleted * cli: added '--decrypt' option to 'kopia blob show'	2020-06-08 08:14:01 -07:00
Jarek Kowalski	293cb10471	cosmetic changes to maintenance behavior (#461 ) - run maintenance even if the command is about to return an error (otherwise if folks have persistent error causing snapshots to fail they will never run maintenance) - disable progress output after snapshotting so that 'kopia snapshot --all' output is clean	2020-06-01 23:38:18 -07:00
Jarek Kowalski	960c33475e	maintenance: disabled automatic compaction on repository opening instead moved to run as part of maintenance ('kopia maintenance run') added 'kopia maintenance run --force' flag which runs maintenance even if not owned	2020-06-01 00:57:32 -07:00
Jarek Kowalski	d68273a576	Improvements for dealing with eventually-consistent stores (S3) (#437 ) * content: added support for cache of own writes Thi keeps track of which blobs (n and m) have been written by the local repository client, so that even if the storage listing is eventually consistent (as in S3), we get somewhat sane behavior. Note that this is still assumming read-after-create semantics, which S3 also guarantees, otherwise it's very hard to do anything useful. * compaction: support for compaction logs Instead of compaction immediately deleting source index blobs, we now write log entries (with `m` prefix) which are merged on reads and applied only if the blob list includes all inputs and outputs, in which case the inputs are discarded since they are known to have been superseded by the outputs. This addresses eventual consistency issues in stores such as S3, which don't guarantee list-after-put or list-after-delete. With such stores the repository is ultimately eventually consistent and there's not much that can be done about it, unless we use second strongly consistent storage (such as GCS) for the index only. * content: updated list cache to cache both `n` and `m` * repo: fixed cache clear on windows Clearing cache requires closing repository first, as Windows is holding the files locked. This requires ability to close the repository twice. * content: refactored index blob management into indexBlobManager * testing: fixed blobtesting.Map storage to allow overwrites * blob: added debug output String() to blob.Metadata * testing: added indexBlobManager stress test This works by using N parallel "actors", each repeatedly performing operations on indexBlobManagers all sharing single eventually consistent storage. Each actor runs in a loop and randomly selects between: - reading all contents in indexes and verifying that it includes all contents written by the actor so far and that contents are correctly marked as deleted - creating new contents - deleting one of previously-created contents (by the same actor) - compacting all index files into one The test runs on accelerated time (every read of time moves it by 0.1 seconds) and simulates several hours of running. In case of a failure, the log should provide enough debugging information to trace the exact sequence of events leading up to the failure - each log line is prefixed with actorID and all storage access is logged. * makefile: increase test timeout * content: fixed index blob manager race The race is where if we delete compaction log too early, it may lead to previously deleted contents becoming temporarily live again to an outside observer. Added test case that reproduces the issue, verified that it fails without the fix and passed with one. * testing: improvements to TestIndexBlobManagerStress test - better logging to be able to trace the root cause in case of a failure - prevented concurrent compaction which is unsafe: The sequence: 1. A creates contentA1 in INDEX-1 2. B creates contentB1 in INDEX-2 3. A deletes contentA1 in INDEX-3 4. B does compaction, but is not seeing INDEX-3 (due to EC or simply because B started read before #3 completed), so it writes INDEX-4==merge(INDEX-1,INDEX-2) * INDEX-4 has contentA1 as active 5. A does compaction but it's not seeing INDEX-4 yet (due to EC or because read started before #4), so it drops contentA1, writes INDEX-5=merge(INDEX-1,INDEX-2,INDEX-3) * INDEX-5 does not have contentA1 7. C sees INDEX-5 and INDEX-5 and merge(INDEX-4,INDEX-5) contains contentA1 which is wrong, because A has been deleted (and there's no record of it anywhere in the system) * content: when building pack index ensure index bytes are different each time by adding 32 random bytes	2020-05-31 17:11:20 -07:00
Jarek Kowalski	9123e3ebff	cli: 'kopia server' made --ui default (#452 )	2020-05-25 19:09:26 -07:00
Julio López	e493177236	Remove legacy flags from snapshot create command (#441 ) Remove `--hostname` and `--username` flags	2020-05-18 22:48:18 -07:00
Jarek Kowalski	ca28469706	cli: improved 'snapshot delete' usage (#436 ) New usage: ``` kopia snapshot delete manifestID... [--delete] kopia snapshot delete rootObjectID... [--delete] ``` Fixes #435 cli: added --unsafe-ignore-source as alias for `--delete` This is a hidden flag for backwards compatibility. It will be removed.	2020-05-13 23:43:45 -07:00
Jarek Kowalski	be4b897579	Support for remote repository (#427 ) Support for remote content repository where all contents and manifests are fetched over HTTP(S) instead of locally manipulating blob storage * server: implement content and manifest access APIs * apiclient: moved Kopia API client to separate package * content: exposed content.ValidatePrefix() * manifest: added JSON serialization attributes to EntryMetadata * repo: changed repo.Open() to return Repository instead of DirectRepository repo: added apiServerRepository * cli: added 'kopia repository connect server' This sets up repository connection via the API server instead of directly-manipulated storage. * server: add support for specifying a list of usernames/password via --htpasswd-file * tests: added API server repository E2E test * server: only return manifests (policies and snapshots) belonging to authenticated user	2020-05-02 21:41:49 -07:00
Jarek Kowalski	1377d057e4	Maintenance changes (#423 ) * maintenance: encrypt maintenance schedule block * maintenance: created snapshotmaintenance package that wraps maintenance and performs snapshot GC + regular maintenance in one shot, used in CLI and server * PR feedback.	2020-05-02 20:40:16 -07:00
Jarek Kowalski	4f6dd94766	Maintenance: report and record record maintenance run timings (#422 ) * mechanical rename of package snapshot/gc => snapshot/snapshotgc * maintenance: record maintenance run times and statuses Also stopped dropping deleted contents during quick maintenance, since doing this safely requires coordinating with snapshot GC which is part of full maintenance. * cli: 'maintenance info' outputs maintenance run history * maintenance: only drop index entries when it's safe to do so This is based on the timestamp of previous successful GC that's old enough to resolve all race conditions between snapshot creation and GC. * maintenance: added internal flush to RewriteContents() to better measure its time	2020-04-20 08:30:03 -07:00
Jarek Kowalski	a462798b28	content: added blob-level metadata cache (#421 ) Unlike regular cache, which caches segments of blobs on a per-content basis, metadata cache will fetch and store the entire metadata blob (q) when any of the contents in it is accessed. Given that there are relatively few metadata blobs compared to data (p) blobs, this will reduce the traffic to the underlying store and improve performance of Snapshot GC which only relies on metadata contents.	2020-04-20 02:18:43 -07:00
Jarek Kowalski	4b4628a21e	Repository maintenance support (#411 ) Maintenance: support for automatic GC Moved maintenance algorithms from 'cli' to 'repo/maintenance' package Added support for CLI commands: kopia gc - performs quick maintenance kopia gc --full- perform full maintenance Full maintenance performs snapshot gc, but it's not safe to do this automatically possibly in parallel to snapshots being taken. This will be addressed ~0.7 timeframe.	2020-04-14 00:11:41 -07:00
Jarek Kowalski	b671bda152	cli: add missing options for configuring sftp known_hosts Possibly the cause of #414	2020-04-12 14:59:24 -07:00
Jarek Kowalski	1f1682b2cc	Snapshot checkpointing (#410 ) * snapshot: support for periodic checkpointing of snapshots in progress For each snapshot that takes longer than 45 minutes, we trigger internal cancellation, save the manifest and restart the snapshot at which point all files will be cached. This helps ensure the property that no file or directory objects in the repository remain unreachable from a snapshot root for more than one hour, which is important from GC perspective. * nit: unified spelling 'cancelled' => 'canceled'	2020-04-07 17:54:21 -07:00
Jarek Kowalski	70d4c8764a	cli: improvements to content selection for list/rewrite/stats/verify (#409 ) They now uniformly support 3 flags: --prefix=P selects contents with the specified prefix --prefixed selects contents with ANY prefix --non-prefixed selects non-prefixed contents Also changed content manager iteration API to support ranges. cli: add --prefix to 'blob gc' and 'blob stats'	2020-04-06 18:43:41 -07:00
Jarek Kowalski	057c2789d8	Kopia UI: support for multiple repositories + portability (#398 ) * server: when serving HTML UI, prefix the title with string from KOPIA_UI_TITLE_PREFIX envar * kopia-ui: support for multiple repositories + portability This is a major rewrite of the app/ codebase which changes how configuration for repositories is maintained and how it flows through the component hierarchy. Portable mode is enabled by creating 'repositories' subdirectory before launching the app. on macOS: <parent>/KopiaUI.app <parent>/repositories/ On Windows, option #1 - nested directory <parent>\KopiaUI.exe <parent>\repositories\ On Windows, option #2 - parallel directory <parent>\some-dir\KopiaUI.exe <parent>\repositories\ In portable mode, repositories will have 'cache' and 'logs' nested in it.	2020-04-04 17:18:37 -07:00
Andreas Schneider	4bed45114d	implemented b2 backend	2020-03-28 09:07:44 -07:00
Julio López	3924fb247a	Minor cleanup for snapshot time override (#392 ) * Simplify cli.parseTimestamp * nit: move duration var closer to where it is used	2020-03-27 00:19:16 -07:00
Jarek Kowalski	6cb9b8fa4f	repo: refactored public API (#318 ) * This is 99% mechanical: Extracted repo.Repository interface that only exposes high-level object and manifest management methods, but not blob nor content management. Renamed old repo.Repository to repo.DirectRepository Reviewed codebase to only depend on repo.Repository as much as possible, but added way for low-level CLI commands to use DirectRepository. * PR fixes	2020-03-26 08:04:01 -07:00
Jarek Kowalski	10bb492926	repo: deprecated NONE algorithm, will not be available for new repositories (#395 ) * repo: deprecated NONE algorithm, will not be available for new repositories Co-authored-by: Julio López <julio+gh@kasten.io>	2020-03-24 23:19:20 -07:00
Jarek Kowalski	60977812f0	Support for gather writes (#373 ) , where blob.Storage.PutBlob gets a list of slices and writes them sequentially * performance: added gather.Bytes and gather.WriteBuffer They are similar to bytes.Buffer but instead of managing a single byte slice, they maintain a list of slices that and when they run out of space they allocate new fixed-size slice from a free list. This helps keep memory allocations completely under control regardless of the size of data written. * switch from byte slices and bytes.Buffer to gather.Bytes. This is mostly mechanical, the only cases where it's not involve blob storage providers, where we leverage the fact that we don't need to ever concatenate the slices into one and instead we can do gather writes. * PR feedback	2020-03-24 15:05:52 -07:00
Nick	393d273e5a	Policy set: fix nil pointer dereference #385 (#387 ) * Policy set: fix nil pointer dereference #385	2020-03-23 18:32:36 -07:00
Jarek Kowalski	9b68a631e6	Highlight snapshot errors in the UI and CLI (#376 ) * upload: exposed numFailed and failedEntries on directory summary * cli: better present snapshot errors * htmlui: display snapshot errors	2020-03-22 14:18:47 -07:00
Seb Patane	6789f8e64c	cli: allow override of snapshot start time and end time	2020-03-21 09:27:32 -07:00
Onkar Bhat	a1f7a068de	Add CLI option for disabling tls verification while connecting to s3 (#370 ) * Add CLI option for disabling tls verification while connecting to s3	2020-03-19 17:21:47 -07:00
Jarek Kowalski	d930f06a27	cli: added flags to control progress output --no-progress - disables progress completely --progress-update-interval=T - controls how frequently progress is updated Fixes #344	2020-03-17 05:11:42 -07:00
Jarek Kowalski	331ff076bb	cli: added number of files that are in the process of being hashed	2020-03-14 15:56:09 -07:00
Jarek Kowalski	8d452a8285	performance: improvements to object manager (#336 ) - added pooled splitters and ability to reset them without having to recreate - added support for caller-provided compressor output to be able to pool it - added pooling of compressor instances, since those are costly	2020-03-13 08:56:18 -07:00
Jarek Kowalski	526445a9c8	CLI: pre-allocated buffers for crypto benchmark (#343 ) non-optimized (0.5.0) 0. BLAKE2B-256-128 AES256-GCM-HMAC-SHA256 644.9 MiB / second before this change: 0. BLAKE2B-256-128 AES256-GCM-HMAC-SHA256 655.9 MiB / second after (this change): 0. BLAKE2B-256-128 AES256-GCM-HMAC-SHA256 781.5 MiB / second	2020-03-12 12:11:20 -07:00
Jarek Kowalski	e80f5536c3	performance: plumbed through output buffer to encryption and hashing,… (#333 ) * performance: plumbed through output buffer to encryption and hashing, so that the caller can pre-allocate/reuse it * testing: fixed how we do comparison of byte slices to account for possible nils, which can be returned from encryption	2020-03-12 08:27:44 -07:00
Julio López	89c0c6bac4	Refactor CLI stats (#341 ) * Helper package internal/stats * Use internal/stats for blob gc stats * Use internal/stats for content list stats * Refactor gc stats - Leverages internal/stats package - Return GC stats - nit: error message formatting - Refactor block in gc.Run. Simplifies and reduces a level of indentation	2020-03-11 22:16:07 -07:00
Jarek Kowalski	6ce8410a29	Added OpenCensus (#339 ) * repo: added some initial metrics using OpenCensus * cli: added flags to expose Prometheus metrics on a local endpoint `--metrics-listen-addr=localhost:X` exposes prometheus metrics on http://localhost:X/metrics Also, kopia server will automatically expose /metrics endpoint on the same port it runs as, without authentication.	2020-03-11 22:07:31 -07:00
Julio López	edc87fcce8	Refactor content stats (#340 ) * Remove unused fields from content.Stats * Refactor content.Stats	2020-03-11 21:47:05 -07:00
Jarek Kowalski	514df69afa	performance: added wrapper around io.Copy() this pools copy buffers so they can be reused instead of throwing away after each io.Copy()	2020-03-10 21:52:30 -07:00
Jarek Kowalski	cd35e3bab5	cli: 'snapshot migrate' improvements to help with data migration - cleaned up migration progress output - fixed migration idempotency - added migration of policies - renamed --parallelism to --parallel - improved e2e test - do not prompt for password to source repository if persisted	2020-03-07 21:47:32 -08:00
Jarek Kowalski	d526843124	cli: fixed metadata cache size on connect/create	2020-03-07 21:47:32 -08:00
Jarek Kowalski	c5cf95fdf6	cli: improved 'content verify' Now you can quickly verify that all contents are correctly backed by existing blob without downloading much. You can still use '--full' to cause full download and decryption.	2020-03-05 17:33:56 -08:00
Jarek Kowalski	889f2ead59	cli: improved 'blob gc' - do not remove blobs younger than 4h when performing blob GC, because they may have just been written - parallelize deletes - clean up console output	2020-03-05 17:33:56 -08:00
Jarek Kowalski	6f68b726a7	cli: improved 'content rewrite' - removed confusing '--prefixed' option - print timestamp of contents as they are rewritten	2020-03-05 17:33:56 -08:00
Jarek Kowalski	a4fad4ca5a	cli: added 'blob stats' command	2020-03-05 17:33:56 -08:00
Jarek Kowalski	fb181257bf	cli: implemented update check, fixes #119	2020-03-04 22:06:05 -08:00

1 2 3 4 5 ...

363 Commits