kopia

mirror of https://github.com/kopia/kopia.git synced 2026-01-24 06:18:02 -05:00

Author	SHA1	Message	Date
Matthieu MOREL	8a176255c0	fix(general): enable wsl for all go files (#4524 ) Signed-off-by: Matthieu MOREL <matthieu.morel35@gmail.com>	2025-04-26 13:01:20 -07:00
Julio López	2a22281271	fix(server): ensure server uploads "repodiag" blobs (#4358 ) The kopia server was not uploading any logs to the repository, because "repodiag" blob uploads would always fail. The cause was the following: when the (log) repodiag blob PUT operation was initiated, the `Context` used for this operation was already canceled. The context used for blob uploads is passed to `repodiag.NewLogManager` when opening the repository. In the case of the kopia server, the repository is asynchronously opened in `server.Server.InitReposotoryAsync`. The context passed to `repo.Open` is canceled after the "open repository" server task completes. This issue was introduced in #1691 Change: Use `context.WithoutCancel()` instead of the context passed when the repo diagnoser is created. New tests are included to reproduce this failure and verify the fix. - test: ensure server logs are uploaded to the repo - test: honor cancellation in map storage - test: repodiag context cancellation Ref: - #1691	2025-01-22 20:13:39 -08:00
Jarek Kowalski	91d00e8256	feat(providers): upgraded rclone to 1.68.2, fixed Google Drive rclone compatibility (#4249 ) * chore(ci): upgrade rclone to 1.68.2 * fix(providers): fixed Google Drive rclone compatibility	2024-11-15 20:02:08 -08:00
Jarek Kowalski	fcb8197f3f	chore(ci): upgraded linter to 1.59.0 (#3883 )	2024-05-29 20:31:57 -07:00
Jarek Kowalski	09415e0c7d	chore(ci): upgraded to go 1.22 (#3746 ) Upgrades go to 1.22 and switches to new-style for loops --------- Co-authored-by: Julio López <1953782+julio-lopez@users.noreply.github.com>	2024-04-08 09:52:47 -07:00
Jarek Kowalski	a8e4d50600	build(deps): upgraded linter to v1.55.2, fixed warnings (#3611 ) * build(deps): upgraded linter to v1.55.2, fixed warnings * removed unsafe hacks with better equivalents * test fixes	2024-02-02 23:34:34 -08:00
Jarek Kowalski	aa064bb442	fix(providers): fixed list/get caching with rclone providers (#3284 ) Added improved providervalidation logic which tests for read-after-write property between connections. The new test was failing before the change and is now passing for Google Drive, OneDrive and DropBox.	2023-09-09 13:07:53 -07:00
Jarek Kowalski	fe55dcb6a2	feat(repository): added hard size limit to the on-disk cache (#3238 ) * test(providers): added capacity limits to blobtesting.mapStorage * refactor(general): added mutex map which dynamically allocates and releases named mutexes * refactor(repository): refactored cache cleanup and limit enforcement * refactor(repository): plumb through cache size limits in the repository * feat(cli): added CLI options to set cache size limits * unified flag setting and field naming * Update cli/command_cache_set.go Co-authored-by: Shikhar Mall <mall.shikhar.in@gmail.com> * pr feedback --------- Co-authored-by: Shikhar Mall <mall.shikhar.in@gmail.com>	2023-08-24 09:38:56 -07:00
Jarek Kowalski	1120e4f629	chore(ci): reenable check-locks after the fix was merged upstream (#3245 )	2023-08-22 19:41:03 -07:00
ashmrtn	29320a7c33	refactor(providers): Create default provider that has common functions (#3241 ) * Rename UnsupportedBlobRetention struct Rename this struct to DefaultProviderImplementation in preparation for adding other simple "default" functionality to it. * Add other functions to default provider Add other simple function implementations to the default provider so that other providers can just embed this to get basic behavior. * Cleanup existing users of default provider * Add default provider to remaining storage types Add the default provider to remaining storage providers and remove functions that are now implemented by the default provider.	2023-08-21 15:27:15 -07:00
ashmrtn	08c58d53b6	feat(providers): Create IsReadOnly API for blob storage (#3230 ) * Add new blob.Storage call to see if it's readonly Return whether the storage is readonly so higher layers in the stack can selectively disable some functionality if needed, like compaction. Co-authored-by: Julio Lopez <1953782+julio-lopez@users.noreply.github.com>	2023-08-21 17:00:50 +00:00
ashmrtn	bb27e3147e	test(repository): More precise check for retention in tests (#3220 ) * Store and return retention info in test storage Add a new interface and function that allows getting retention information during testing. This allows for more exact comparisons about retention duration and mode in tests. * Fixup how blobtesting retention extension works Use the clock instead of the object's mod time so that extensions are from the "current time." This aligns with how the S3 blob storage functions. * Update retention tests to use more precise checks Where possible, use the information returned by GetRetention in tests that deal with retention information. This allows for more precise comparions of retention duration and mode instead of indirectly testing duration by advancing the clock and attempting to modify blobs.	2023-08-15 16:53:27 +00:00
ashmrtn	2b73527b43	refactor(general): Cleaner error checking in retention tests (#3164 ) * More robust error comparisons in retention tests Update tests for retention to use `ErrorIs` checks instead of comparing error messages. * Use `require.NoError` in retention tests Minor cleanup to reduce branches in code by using `require.NoError` instead of if-blocks and `t.Fatal`.	2023-07-21 00:29:17 +00:00
PhracturedBlue	42aad38540	feat(repository): Implement retention time extension on S3 buckets using Object Locks (#2179 ) * Implement ability to extend retention time on S3 buckets using Object Locks * Move object-lock extension to maintenance.Params. * Use a default function for unsupported extensions instead of duplicating code * Fix potential lockup during object-lock extension * Fix race condition. Add more code coverage * rebase to V3 * Add checks to prevent user from setting Retention Period < Full Maintenance Interval --------- Co-authored-by: Ashlie Martinez <ashmrtnz@alcion.ai>	2023-07-03 16:20:02 -07:00
Aaron Alpar	ac8446b028	fix(deps-dev): fixup mutex for make check-locks (#2920 )	2023-04-11 15:58:20 -04:00
Shikhar Mall	9f9309ad2a	feat(repository): live cache eviction of expired BLOBs in persistent LRU content cache (#2879 ) * feat(repository): live cache eviction for persistent lru content cache * Update internal/cache/persistent_lru_cache.go Co-authored-by: Ali Dowair <adowair@umich.edu> * merge the mutex cache into list cache --------- Co-authored-by: Shikhar Mall <small@kopia.io> Co-authored-by: Ali Dowair <adowair@umich.edu>	2023-04-08 04:30:45 +00:00
Edward Betts	1e97574391	fix(general): correct spelling mistakes (#2684 )	2023-01-21 07:37:15 -08:00
Jarek Kowalski	51dcaa985d	chore(ci): upgraded linter to 1.48.0 (#2294 ) Mechanically fixed all issues, added `lint-fix` make target.	2022-08-09 06:07:54 +00:00
Jarek Kowalski	daa62de3e4	chore(ci): added checklocks static analyzer (#1838 ) From https://github.com/google/gvisor/tree/master/tools/checklocks This will perform static verification that we're using `sync.Mutex`, `sync.RWMutex` and `atomic` correctly to guard access to certain fields. This was mostly just a matter of adding annotations to indicate which fields are guarded by which mutex. In a handful of places the code had to be refactored to allow static analyzer to do its job better or to not be confused by some constructs. In one place this actually uncovered a bug where a function was not releasing a lock properly in an error case. The check is part of `make lint` but can also be invoked by `make check-locks`.	2022-03-19 22:42:59 -07:00
Ali Dowair	c7ce87f95b	feat(providers): Implement API to get Storage free space (#1753 ) * Introduce Volume sub-interface The Volume interface defines APIs to access a storage provider's volume (disk) capacity, usage, etc.. It is inherited by the Storage interface, and is at the same hierarchical level as the Reader interface. * Add validations for new Volume method: Check that GetCapacity() either returns `ErrNotAVolume`, or that it returns a Capacity struct with values that make sense. * Implement default (passthrough) GetCapacity: Cloud providers do not have finite volumes, and WebDAV volumes have no notion of volume size and usage. These implementations should just return an error (ErrNotAVolume) when their GetCapacity() is called. * Implement GetCapacity for sftp storage: Uses the sftp.Client interface * Implement GetCapacity for logging, readonly store * Implement GetCapacity() for blobtesting implementations * Implement GetCapacity() for Google Drive: Also modifies GetDriveClient to return the entire service instead of just the Files client. * Implemented GetCapacity() for filesystem storage: Implemented the function in a seperate file for each OS/architecture (Unix, OpenBSD, Windows).	2022-03-16 12:35:33 -07:00
Onkar Bhat	6f55e65cbd	feat(providers): treat token expiration errors as non-retryable (#1675 ) Token expiration errors should be treated as non-retryable errors.	2022-02-02 17:38:32 -08:00
Jarek Kowalski	f67274e229	fix(providers): fixed DoNotRecreate and tests for gcs (#1688 ) Also simplified validation test suite, which will simply test whether the provider supports DoNotRecreate or properly rejects it without external configuration.	2022-01-29 09:12:07 -08:00
Ali Dowair	7ca8b85a57	feat(providers): expand PutBlob API to allow for idempotent puts (#1654 ) * Add a new PutBlob option and blob error type When `DoNotRecreate` is set as true, the blob put operation should only succeed if no blob with the given blob ID already exists. Othwerwise, `ErrBlobAlreadyExists` is returned. * Validate default storage providers' support By default, storage providers should not support idempotent creates. This commit adds error handling to exit early if `DoNotRecreate` is set to true. The commit also verifies this behavior in the provider validation test. * Implement support for new option in GCS storage * Push PutBlob option handling down to Impl When PutBlob options were introduced, error handling logic for them was implemented for the Sharded storage interface. However, the behavior of different providers that implement Sharded can be different, so it's better to push the options down to be processed in the provider implementations. * Introduce new error type for unsupported put opts To unify error handling code and make it more maintainable, introduce a new error type `blob.ErrUnsupportedPutBlobOption`, which is to be returned whenever a storage provider implementation is given put options it does not support.	2022-01-27 08:49:06 -08:00
Shikhar Mall	b592776edf	feat(repository): persistence for blob-retention configuration (#1596 ) * feat: persisting retention options in repository blob - plumb retention parameters through wrapped storage - generalize aes encryption mechanism - rewrite the retention blob on password change - do not write retention blob when empty * handle retention-blob not-found failures * cli params to set retention modes on repository create * enable versioned map mock storage with retention settings * adding unit tests * write format and retention blob with retention settings if available * rename certain functions and constants specific to format blob * delete retention cache on password-change * fix: replace SetTime() api call with TouchBlob() * Update repo/repository_test.go Co-authored-by: Nick <nick@kasten.io> * pr feedback and codecov improvements * fix: rename retention-blob structures to generic blob-cfg * fix: remove minio dependency on retention constants Co-authored-by: Shikhar Mall <shikhar@kasten.io> Co-authored-by: Nick <nick@kasten.io>	2022-01-22 08:37:00 -08:00
Jarek Kowalski	7401684e71	blob: replaced blob.Storage.SetTime() method with blob.PutOptions.SetTime (#1595 ) * sharded: plumbed through blob.PutOptions * blob: removed blob.Storage.SetTime() method This was only used for `kopia repo sync-to` and got replaced with an equivalent blob.PutOptions.SetTime, which wehn set to non-zero time will attempt to set the modification time on a file. Since some providers don't support changing modification time, we are able to emulate it using per-blob metadata (on B2, Azure and GCS), sadly S3 is still unsupported, because it does not support returning metadata in list results. Also added PutOptions.GetTime, which when set to not nil, will populate the provided variable with actual time that got assigned to the blob. Added tests that verify that each provider supports GetTime and SetTime according to this spec. * blob: additional test coverage for filesystem storage * blob: added PutBlobAndGetMetadata() helper and used where appropriate * fixed test failures * pr feedback * Update repo/blob/azure/azure_storage.go Co-authored-by: Shikhar Mall <mall.shikhar.in@gmail.com> * Update repo/blob/filesystem/filesystem_storage.go Co-authored-by: Shikhar Mall <mall.shikhar.in@gmail.com> * Update repo/blob/filesystem/filesystem_storage.go Co-authored-by: Shikhar Mall <mall.shikhar.in@gmail.com> * blobtesting: fixed object_locking_map.go * blobtesting: removed SetTime from ObjectLockingMap Co-authored-by: Shikhar Mall <mall.shikhar.in@gmail.com>	2021-12-18 14:00:20 -08:00
Shikhar Mall	3f4dab2202	versioned map mock storage for blob-retention unit-testing (#1590 ) * versioned map mock storage * Update internal/blobtesting/versionedmap.go Co-authored-by: Julio Lopez <julio+gh@kasten.io> * versionedMapStorage -> objectLockingMap * move out write checks in a dedicated method Co-authored-by: Shikhar Mall <shikhar@kasten.io> Co-authored-by: Julio Lopez <julio+gh@kasten.io>	2021-12-17 13:26:34 -08:00
Jarek Kowalski	081c252e18	blobtesting: refactored fault injection into fluent API (#1578 )	2021-12-13 11:11:43 -08:00
Jarek Kowalski	cead806a3f	blob: changed default shards from {3,3} to {1,3} (#1513 ) * blob: changed default shards from {3,3} to {1,3} Turns out for very large repository around 100TB (5M blobs), we end up creating max ~16M directories which is way too much and slows down listing. Currently each leaf directory only has a handful of files. Simple sharding of {3} should work much better and will end up creating directories with meaningful shard sizes - 12 K files per directory should not be too slow and will reduce the overhead of listing by 4096 times. The change is done in a backwards-compatible way and will respect custom sharding (.shards) file written by previous 0.9 builds as well as older repositories that don't have the .shards file (which we assume to be {3,3}). * fixed compat tests	2021-11-16 06:02:04 -08:00
Shikhar Mall	2857c4831a	storage api put-blob retention options (#1511 ) * storage api put-blob retention options Co-authored-by: Shikhar Mall <shikhar@kasten.io>	2021-11-15 19:46:42 -08:00
David Zaninovic	540910e854	Use blob.OutputBuffer in blob.Reader interface instead of internal gather.WriteBuffer (#1452 ) * Use blob.OutputBuffer in blob.Reader interface instead of internal gather.WriteBuffer * blob: remove blob.Storage.OutputBuffer methods Append() and ToByteSlice() (cherry picked from commit 36d30b3b5f1f916e95493ca7552e6612f56624a6) Co-authored-by: Jarek Kowalski <jaak@jkowalski.net>	2021-11-03 11:58:49 -07:00
Jarek Kowalski	0d0f48a7ee	clock: discard monotonic clock component in clock.Now() (#1437 ) The dual time measurement is described in https://go.googlesource.com/proposal/+/master/design/12914-monotonic.md The fix is to discard hidden monotonic time component of time.Time by converting to unix time and back. Reviewed usage of clock.Now() and replaced with timetrack.StartTimer() when measuring time. The problem in #1402 was that passage of time was measured using the monotonic time and not wall clock time. When the computer goes to sleep, monotonic time is still monotonic while wall clock time makes a leap when the computer wakes up. This is the behavior that epoch manager (and most other compontents in Kopia) rely upon. Fixes #1402 Co-authored-by: Julio Lopez <julio+gh@kasten.io>	2021-10-22 15:35:09 -07:00
Jarek Kowalski	8b760b66a8	logging: added memoization of Logger instances per context (#1369 )	2021-10-09 05:02:18 -07:00
Eng Zer Jun	73e492c9db	refactor: move from io/ioutil to io and os package (#1360 ) * refactor: move from io/ioutil to io and os package The io/ioutil package has been deprecated as of Go 1.16, see https://golang.org/doc/go1.16#ioutil. This commit replaces the existing io/ioutil functions with their new definitions in io and os packages. Signed-off-by: Eng Zer Jun <engzerjun@gmail.com> * chore: remove //nolint:gosec for os.ReadFile At the time of this commit, the G304 rule of gosec does not include the `os.ReadFile` function. We remove `//nolint:gosec` temporarily until https://github.com/securego/gosec/pull/706 is merged. Signed-off-by: Eng Zer Jun <engzerjun@gmail.com>	2021-10-06 08:39:10 -07:00
Jarek Kowalski	35d0f31c0d	huge: replaced the use of allocated byte slices with populating gather.WriteBuffer in the repository (#1244 ) This helps recycle buffers more efficiently during snapshots. Also, improved memory tracking, enabled profiling flags and added pprof by default.	2021-08-20 08:45:10 -07:00
Jarek Kowalski	e42cc6ccce	Added 'kopia repository validate-provider` (#1205 ) * cli: added 'repository validate-provider' which runs a set of tests against blob storage provider to validate it This implements a provider tests which exercises subtle behaviors which are not always correctly implemented by providers claiming compatibility with S3, for example. The test checks: - not found behavior - prefix scans - timestamps - write atomicity * retry: improved error message on failure * rclone: fixed stats reporting and awaiting for completion * webdav: prevent panic when attempting to mkdir with empty name * testing: run providervalidation.ValidateProvider as part of regular provider tests * cli: print a recommendation to validate provider after repository creation	2021-07-19 21:42:24 -07:00
Jarek Kowalski	8c2e4ff799	testing: simplified S3 test to fix error caused by minio behavior change	2021-07-17 20:01:00 -07:00
Jarek Kowalski	4c9b9e523a	Test cleanup improvements (#1203 ) * testing: removed testutil.Retry because all providers now have internal retries * testing: simplified and unified cleanup for all cloud providers using shared buckets	2021-07-17 18:54:37 -07:00
Jarek Kowalski	4c2f52a2e3	Rclone and testing improvements (#1202 ) * sharded: added parallel iteration of blobs to improve performance * retry: reduce first retry delay 1s->100ms * testing: additional assertions for blob storage testing * rclone: testing cleanup improvements, re-enabled OneDrive * cli: added --list-parallelism parameter to fs,webdav,sftp and rclone * sharded: added dedicated test	2021-07-17 16:04:51 -07:00
Jarek Kowalski	6277fa27c9	content: refactored own writes cache and list cache into blob.Storage wrappers (#1133 ) added blob.Storage.FlushCaches method.	2021-06-12 19:22:25 -07:00
Jarek Kowalski	46a6cc3f24	blob: minor improvements + test coverage (#1127 ) * blob: additional utility functions and test coverage * testing: made faulty storage less noisy	2021-06-10 18:35:00 -07:00
Jarek Kowalski	a5fc1f57e0	testing: allow bigger time drift between GetMetadata and ListBlobs - appears to be needed by Wasabi	2021-04-06 08:57:52 -07:00
Jarek Kowalski	d0f2ef53d7	blob: improved startup error handling of rclone and webdav PutBlob race (#915 ) * added framework for unit testing against remote real rclone remotes, added google drive backend * added parallelism to blobtesting which revealed some races during PutBlob with WebDAV.	2021-03-28 08:26:35 -07:00
Jarek Kowalski	b8e89c2808	s3: refactored testing to support multiple S3-compatible providers (#914 ) The credentials are passed via JSON matching s3.Options struct Fixed some verification issue with Wasabi.	2021-03-26 14:33:20 -07:00
Jarek Kowalski	b6e68fa28a	Fixed few coverage flakes (#872 ) * blobtesting: coverage for GetMetadata() returning ErrNotFound * content: additional direct coverage for diskCommittedContentIndexCache	2021-03-07 00:03:20 -08:00
Jarek Kowalski	d734c20918	Added cases for missing code coverage where we only had randomized tests (#859 ) * nit: missing code coverage * testing: coverage for upload scanner when context is canceled * content: hit flaky select branch in TestIterateContents	2021-02-28 18:52:32 -08:00
Jarek Kowalski	d9b58fb6e2	linter: upgraded to 1.37.1 (#846 )	2021-02-20 11:34:14 -08:00
Jarek Kowalski	1f3b8d4da4	upgrade linter to 1.35 (#786 ) * lint: added test that enforces Makefile and GH action linter versions are in sync * workaround for linter gomnd problem - https://github.com/golangci/golangci-lint/issues/1653	2021-01-16 18:21:16 -08:00
Jarek Kowalski	f517703079	Preliminary support for sessions (#752 ) * content: fixed time-based auto-flush behavior to behave like Flush() Previously it would sometimes be possible for a content whose write started before time-based flush to finish writing afterwards (and it would be included in the new index). Refactored the code so that time-based flush happens before WriteContent write and behaves exactly the same was as real Flush() so all writes started before it will be awaited during the flush. Also previous regression test was incorrect since it was mocking the wrong blob method. * content: refactored index blob manager crypto to separate file This will be reused for encrypting session info. * content: added support for session markers Session marker (`s` blob) is written BEFORE the first data blob (`p` or `q`) that belongs to new index segment (`n` is written). Session marker is removed AFTER the index blob (`n`) has been written. All pack and index blobs belonging to a session will have the session ID as its suffix, so that if a reader can see `s<sessionID>` blob, they will ignore any `p` and `q` blobs with the same suffix. * maintenance: ignore blobs belonging to active sessions when running blob garbage collection * cli: added 'sessions list' for listing active sessions * content: added retrying writing previously failed blobs before writing new one	2021-01-14 00:25:51 -08:00
Jarek Kowalski	e03971fc59	Upgraded linter to v1.33.0 (#734 ) * linter: upgraded to 1.33, disabled some linters * lint: fixed 'errorlint' errors This ensures that all error comparisons use errors.Is() or errors.As(). We will be wrapping more errors going forward so it's important that error checks are not strict everywhere. Verified that there are no exceptions for errorlint linter which guarantees that. * lint: fixed or suppressed wrapcheck errors * lint: nolintlint and misc cleanups Co-authored-by: Julio López <julio+gh@kasten.io>	2020-12-21 22:39:22 -08:00
Jarek Kowalski	fd24227379	b2: fixed handling of 'no_such_file' to indicate NOT_FOUND (#646 ) Fixes #645	2020-09-26 21:01:04 -07:00

1 2

67 Commits