kopia

mirror of https://github.com/kopia/kopia.git synced 2026-01-04 04:27:53 -05:00

Author	SHA1	Message	Date
Julio Lopez	9a9048c121	breaking(cli): remove default behavior for CLI command (#2861 ) * breaking(cli): remove default behavior for `snapshot` command command: snapshot default-subcommand: create * breaking(cli): remove default behavior for `cache` command command: cache default-subcommand: info * breaking(cli): remove default behavior for `index` command command: index default-subcommand: list * breaking(cli): remove default behavior for `maintenance` command command: maintenance default-subcommand: run * breaking(cli): remove default behavior for `manifest` command command: manifest default-subcommand: list * breaking(cli): remove default behavior for `repository upgrade` command command: repository upgrade default-subcommand: begin * breaking(cli): remove default behavior for `server` command command: server default-subcommand: start	2023-04-09 01:34:36 +00:00
Jarek Kowalski	62ad437bb6	Implemented epoch-based index manager (#1174 ) * epoch: misc fixes and logging * blob: misc helpers * cli: removed useless 'repository upgrade', replaced by 'repository set-parameters' * content: implemented indexBlobManagerV1 which uses epoch manager * cli: commands to manipulate repository epoch parameters * cli: commands to examine epoch-based indexes * content: added test suite that uses epoch-based index manager * content: fixed a ton of test data races caused by sharing blobtesting.DataMap * cli: additional tests and validation for 'repository set-params' * testing: replaced the use of suite with our own, since suite is not parallelizable	2021-07-06 21:38:08 -07:00
Jarek Kowalski	8b0296cdf2	Misc index blob manager refactorings (#1138 ) * content: extracted encryptedBlobMgr component * content: renamed files * content: refactored ParseIndexBlob * epoch: adjusted API to return blob.Metadata * content: removed IndexBlobReader interface * content: cleaned up indexBlobManager API	2021-06-13 18:52:49 -07:00
Jarek Kowalski	a461d767f7	cli: plumbed through 'textOutput' which controls stdout/stderr writers (#1053 ) This is mostly for testability.	2021-05-06 20:26:35 -07:00
Jarek Kowalski	d2288c443f	cli: major refactoring (#1046 ) cli: major refactoring of how CLI commands are registered The goal is to eliminate flags as global variables to allow for better testing. Each command and subcommand and most sets of flags are now their own struct with 'setup()' methods that attached the flags or subcommand to the provided parent. This change is 94.3% mechanical, but is fully organic and hand-made. * introduced cli.appServices interface which provides the environment in which commands run * remove auto-maintenance global flag * removed globals in memory_tracking.go * removed globals from cli_progress.go * removed globals from the update_check.go * moved configPath into TheApp * removed remaining globals from config.go * refactored logfile to get rid of global variables * removed 'app' global variable * linter fixes * fixed password_.go build fixed BSD build	2021-05-03 10:28:00 -07:00
Jarek Kowalski	74833cefcb	cli: added standard --json flags to several commands (#910 ) * cli: added standard --json flags to several commands Fixes #272 * Update flag description Co-authored-by: Julio López <julio+gh@kasten.io>	2021-03-25 17:55:18 -07:00
Jarek Kowalski	fa7976599c	repo: refactored repository interfaces (#780 ) - `repo.Repository` is now read-only and only has methods that can be supported over kopia server - `repo.RepositoryWriter` has read-write methods that can be supported over kopia server - `repo.DirectRepository` is read-only and contains all methods of `repo.Repository` plus some low-level methods for data inspection - `repo.DirectRepositoryWriter` contains write methods for `repo.DirectRepository` - `repo.Reader` removed and merged with `repo.Repository` - `repo.Writer` became `repo.RepositoryWriter` - `repo.DirectRepository` struct became `repo.DirectRepository` interface Getting `{Direct}RepositoryWriter` requires using `NewWriter()` or `NewDirectWriter()` on a read-only repository and multiple simultaneous writers are supported at the same time, each writing to their own indexes and pack blobs. `repo.Open` returns `repo.Repository` (which is also `repo.RepositoryWriter`). content: removed implicit flush on content manager close * repo: added tests for WriteSession() and implicit flush behavior * invalidate manifest manager after write session * cli: disable maintenance in 'kopia server start' Server will close the repository before completing. * repo: unconditionally close RepositoryWriter in {Direct,}WriteSession * repo: added panic in case somebody tries to create RepositoryWriter after closing repository - used atomic to manage SharedManager.closed * removed stale example * linter: fixed spurious failures Co-authored-by: Julio López <julio+gh@kasten.io>	2021-01-20 11:41:47 -08:00
Jarek Kowalski	d3a6421213	cli: make sure we don't run maintenance as part of read-only actions (#769 ) * cli: make sure we don't run maintenance as part of read-only actions * cli: classified some actions that use *repo.DirectRepository as read-only too	2021-01-05 21:55:02 -08:00
Jarek Kowalski	e03971fc59	Upgraded linter to v1.33.0 (#734 ) * linter: upgraded to 1.33, disabled some linters * lint: fixed 'errorlint' errors This ensures that all error comparisons use errors.Is() or errors.As(). We will be wrapping more errors going forward so it's important that error checks are not strict everywhere. Verified that there are no exceptions for errorlint linter which guarantees that. * lint: fixed or suppressed wrapcheck errors * lint: nolintlint and misc cleanups Co-authored-by: Julio López <julio+gh@kasten.io>	2020-12-21 22:39:22 -08:00
Jarek Kowalski	d68273a576	Improvements for dealing with eventually-consistent stores (S3) (#437 ) * content: added support for cache of own writes Thi keeps track of which blobs (n and m) have been written by the local repository client, so that even if the storage listing is eventually consistent (as in S3), we get somewhat sane behavior. Note that this is still assumming read-after-create semantics, which S3 also guarantees, otherwise it's very hard to do anything useful. * compaction: support for compaction logs Instead of compaction immediately deleting source index blobs, we now write log entries (with `m` prefix) which are merged on reads and applied only if the blob list includes all inputs and outputs, in which case the inputs are discarded since they are known to have been superseded by the outputs. This addresses eventual consistency issues in stores such as S3, which don't guarantee list-after-put or list-after-delete. With such stores the repository is ultimately eventually consistent and there's not much that can be done about it, unless we use second strongly consistent storage (such as GCS) for the index only. * content: updated list cache to cache both `n` and `m` * repo: fixed cache clear on windows Clearing cache requires closing repository first, as Windows is holding the files locked. This requires ability to close the repository twice. * content: refactored index blob management into indexBlobManager * testing: fixed blobtesting.Map storage to allow overwrites * blob: added debug output String() to blob.Metadata * testing: added indexBlobManager stress test This works by using N parallel "actors", each repeatedly performing operations on indexBlobManagers all sharing single eventually consistent storage. Each actor runs in a loop and randomly selects between: - reading all contents in indexes and verifying that it includes all contents written by the actor so far and that contents are correctly marked as deleted - creating new contents - deleting one of previously-created contents (by the same actor) - compacting all index files into one The test runs on accelerated time (every read of time moves it by 0.1 seconds) and simulates several hours of running. In case of a failure, the log should provide enough debugging information to trace the exact sequence of events leading up to the failure - each log line is prefixed with actorID and all storage access is logged. * makefile: increase test timeout * content: fixed index blob manager race The race is where if we delete compaction log too early, it may lead to previously deleted contents becoming temporarily live again to an outside observer. Added test case that reproduces the issue, verified that it fails without the fix and passed with one. * testing: improvements to TestIndexBlobManagerStress test - better logging to be able to trace the root cause in case of a failure - prevented concurrent compaction which is unsafe: The sequence: 1. A creates contentA1 in INDEX-1 2. B creates contentB1 in INDEX-2 3. A deletes contentA1 in INDEX-3 4. B does compaction, but is not seeing INDEX-3 (due to EC or simply because B started read before #3 completed), so it writes INDEX-4==merge(INDEX-1,INDEX-2) * INDEX-4 has contentA1 as active 5. A does compaction but it's not seeing INDEX-4 yet (due to EC or because read started before #4), so it drops contentA1, writes INDEX-5=merge(INDEX-1,INDEX-2,INDEX-3) * INDEX-5 does not have contentA1 7. C sees INDEX-5 and INDEX-5 and merge(INDEX-4,INDEX-5) contains contentA1 which is wrong, because A has been deleted (and there's no record of it anywhere in the system) * content: when building pack index ensure index bytes are different each time by adding 32 random bytes	2020-05-31 17:11:20 -07:00
Jarek Kowalski	6cb9b8fa4f	repo: refactored public API (#318 ) * This is 99% mechanical: Extracted repo.Repository interface that only exposes high-level object and manifest management methods, but not blob nor content management. Renamed old repo.Repository to repo.DirectRepository Reviewed codebase to only depend on repo.Repository as much as possible, but added way for low-level CLI commands to use DirectRepository. * PR fixes	2020-03-26 08:04:01 -07:00
Jarek Kowalski	54edb97b3a	refactoring: renamed repo/block to repo/content Also introduced strongly typed content.ID and manifest.ID (instead of string) This aligns identifiers across all layers of repository: blob.ID content.ID object.ID manifest.ID	2019-06-01 22:24:19 -07:00

12 Commits