kopia

mirror of https://github.com/kopia/kopia.git synced 2026-05-11 00:04:46 -04:00

Author	SHA1	Message	Date
Matthieu MOREL	8a176255c0	fix(general): enable wsl for all go files (#4524 ) Signed-off-by: Matthieu MOREL <matthieu.morel35@gmail.com>	2025-04-26 13:01:20 -07:00
Prasad Ghangal	3bf947d746	feat(repository): Metadata compression config support for directory and indirect content (#4080 ) * Configure compressor for k and x prefixed content Adds metadata compression setting to policy Add support to configure compressor for k and x prefixed content Set zstd-fastest as the default compressor for metadata in the policy Adds support to set and show metadata compression to kopia policy commands Adds metadata compression config to dir writer Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com> * Pass concatenate options with ConcatenateOptions struct Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com> * Move content compression handling to caller Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com> * Move handling manifests to manifest pkg Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com> * Correct const in server_test Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com> * Remove unnecessary whitespace Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com> * Disable metadata compression for < V2 format Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com> --------- Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com>	2024-10-23 23:28:23 -07:00
Julio López	961a39039b	refactor(general): use `errors.New` where appropriate (#4160 ) Replaces 'errors.Errorf\("([^"]+)"\)' => 'errors.New("\1")'	2024-10-05 19:05:00 -07:00
Jarek Kowalski	e36fa78385	feat(snapshots): added support for per-directory splitter overrides (#3887 ) This is useful when backing up directories that have giant files aligned at MiB boundary, such as VM disk backups, etc.	2024-06-07 13:42:15 -07:00
Jarek Kowalski	09415e0c7d	chore(ci): upgraded to go 1.22 (#3746 ) Upgrades go to 1.22 and switches to new-style for loops --------- Co-authored-by: Julio López <1953782+julio-lopez@users.noreply.github.com>	2024-04-08 09:52:47 -07:00
Jarek Kowalski	d0fc1e03c4	fix(server): do not make blocking calls inside server status API (#3666 ) also reduce global server lock scope	2024-02-21 12:34:16 -08:00
Jarek Kowalski	524ffaf4b8	refactor(repository): added context to potentially blocking repository methods (#3654 ) Primarily for wiring a context.Context to a call to content.Manager.refresh, which was using a detached context.	2024-02-20 14:48:23 -08:00
Jarek Kowalski	a8e4d50600	build(deps): upgraded linter to v1.55.2, fixed warnings (#3611 ) * build(deps): upgraded linter to v1.55.2, fixed warnings * removed unsafe hacks with better equivalents * test fixes	2024-02-02 23:34:34 -08:00
Jarek Kowalski	7ee30b76bb	fix(repository): fixed handling of content.Info (#3356 ) * fix(repository): fixed handling of content.Info Previously content.Info was an interface which was implemented by: * index.InfoStruct * index.indexEntryInfoV1 * index.indexEntryInfoV2 The last 2 implementations were relying on memory-mapped files which in rare cases could be closed while Kopia was still processing them leading to #2599. This changes fixes the bug and strictly separates content.Info (which is now always a struct) from the other two (which were renamed as index.InfoReader and only used inside repo/content/...). In addition to being safer, this _should_ reduce memory allocations. * reduce the size of content.Info with proper alignment. * pr feedback * renamed index.InfoStruct to index.Info	2023-10-14 10:34:15 -07:00
Julio Lopez	a99e38c247	fix(lint): remove uses of deprecated rand.Read (#2858 ) Lint fixes in preparation for moving to Go 1.20 Remove deprecated calls to `rand.Seed` In Go 1.20 the default generator is seeded randomly at program startup, which is the desired behavior for these tests. Remove uses of deprecated rand.Read: replace with calls to rand.Uint64() Remove deprecated uses of rand.Read in content manager tests and S3 versioned tests. Adds a concurrency-safe helpers to provide functionality similar to that provided by `rand.Read(b []byte) (int, error)`	2023-03-28 01:44:09 +00:00
Jarek Kowalski	e57020fb70	test(repository): server testability refactoring (#2612 ) - removed repo.OpenAPIServer() which was only needed for testability - introduced servertesting package to replace it	2022-12-01 06:27:52 +00:00
Jarek Kowalski	78edd92692	refactor(repository): refactored Prometheus metrics (#2532 ) This may be a breaking change for users who rely on particular kopia metrics (unlikely): - introduced blob-level metrics: * `kopia_blob_download_full_blob_bytes_total` * `kopia_blob_download_partial_blob_bytes_total` * `kopia_blob_upload_bytes_total` * `kopia_blob_storage_latency_ms` - per-method latency distribution * `kopia_blob_errors_total` - per-method error counter - updated cache metrics to indicate particular cache * `kopia_cache_hit_bytes_total{cache="CACHE_TYPE"}` * `kopia_cache_hit_total{cache="CACHE_TYPE"}` * `kopia_cache_malformed_total{cache="CACHE_TYPE"}` * `kopia_cache_miss_total{cache="CACHE_TYPE"}` * `kopia_cache_miss_errors_total{cache="CACHE_TYPE"}` * `kopia_cache_miss_bytes_total{cache="CACHE_TYPE"}` * `kopia_cache_store_errors_total{cache="CACHE_TYPE"}` where `CACHE_TYPE` is one of `contents`, `metadata` or `index-blobs` - reorganized and unified content-level metrics: * `kopia_content_write_bytes_total` * `kopia_content_write_duration_nanos_total` * `kopia_content_compression_attempted_bytes_total` * `kopia_content_compression_attempted_duration_nanos_total` * `kopia_content_compression_savings_bytes_total` * `kopia_content_compressible_bytes_total` * `kopia_content_non_compressible_bytes_total` * `kopia_content_after_compression_bytes_total` * `kopia_content_decompressed_bytes_total` * `kopia_content_decompressed_duration_nanos_total` * `kopia_content_encrypted_bytes_total` * `kopia_content_encrypted_duration_nanos_total` * `kopia_content_hashed_bytes_total` * `kopia_content_hashed_duration_nanos_total` * `kopia_content_deduplicated_bytes_total` * `kopia_content_read_bytes_total` * `kopia_content_read_duration_nanos_total` * `kopia_content_decrypted_bytes_total` * `kopia_content_decrypted_duration_nanos_total` * `kopia_content_uploaded_bytes_total` Also introduced `internal/metrics` framework which constructs Prometheus metrics in a uniform way and will allow us to include some of these metrics in telemetry report in future PRs.	2022-11-10 05:30:06 +00:00
Jarek Kowalski	51dcaa985d	chore(ci): upgraded linter to 1.48.0 (#2294 ) Mechanically fixed all issues, added `lint-fix` make target.	2022-08-09 06:07:54 +00:00
Jarek Kowalski	23299c3451	refactor(repository): ensure MutableParameters are never cached (#2284 )	2022-08-06 18:11:32 -07:00
Jarek Kowalski	6160ee5668	refactor(repository): moved format blob management to separate package (#2245 ) * refactor(repository): moved format blob management to separate package This is completely mechanical, no behavior changes, only: - moved types and functions to a new package - adjusted visibility where needed - added missing godoc - renamed some identifiers to align with current usage - mechanically converted some top-level functions into member functions - fixed some mis-named variables * refactor(repository): moved content.FormatingOptions to format.ContentFormat	2022-07-30 14:13:52 -07:00
Jarek Kowalski	9bf9cac7fb	refactor(repository): ensure we always parse content.ID and object.ID (#1960 ) * refactor(repository): ensure we always parse content.ID and object.ID This changes the types to be incompatible with string to prevent direct conversion to and from string. This has the additional benefit of reducing number of memory allocations and bytes for all IDs. content.ID went from 2 allocations to 1: typical case 32 characters + 16 bytes per-string overhead worst-case 65 characters + 16 bytes per-string overhead now: 34 bytes object.ID went from 2 allocations to 1: typical case 32 characters + 16 bytes per-string overhead worst-case 65 characters + 16 bytes per-string overhead now: 36 bytes * move index.{ID,IDRange} methods to separate files * replaced index.IDFromHash with content.IDFromHash externally * minor tweaks and additional tests * Update repo/content/index/id_test.go Co-authored-by: Julio Lopez <1953782+julio-lopez@users.noreply.github.com> * Update repo/content/index/id_test.go Co-authored-by: Julio Lopez <1953782+julio-lopez@users.noreply.github.com> * pr feedback * post-merge fixes * pr feedback * pr feedback * fixed subtle regression in sortedContents() This was actually not producing invalid results because of how base36 works, just not sorting as efficiently as it could. Co-authored-by: Julio Lopez <1953782+julio-lopez@users.noreply.github.com>	2022-05-25 14:15:56 +00:00
Jarek Kowalski	daa62de3e4	chore(ci): added checklocks static analyzer (#1838 ) From https://github.com/google/gvisor/tree/master/tools/checklocks This will perform static verification that we're using `sync.Mutex`, `sync.RWMutex` and `atomic` correctly to guard access to certain fields. This was mostly just a matter of adding annotations to indicate which fields are guarded by which mutex. In a handful of places the code had to be refactored to allow static analyzer to do its job better or to not be confused by some constructs. In one place this actually uncovered a bug where a function was not releasing a lock properly in an error case. The check is part of `make lint` but can also be invoked by `make check-locks`.	2022-03-19 22:42:59 -07:00
Jarek Kowalski	69dc7ba969	feat(repository): added 'hint' to Prefetch methods. (#1825 )	2022-03-12 23:16:39 -08:00
Jarek Kowalski	db7dcac33c	feat(repo): exposed PrefetchContents on the repo.Repository() (#1824 )	2022-03-12 14:36:54 -08:00
Jarek Kowalski	e67f84e0ba	chore(general): updated linter to 1.44.0 (#1681 )	2022-01-25 21:21:13 -08:00
Jarek Kowalski	bbbef44d8a	More coverage improvements (#1577 ) * increased direct coverage for internal/cache * object: code coverage improvements for object writer	2021-12-11 23:27:42 -08:00
Eng Zer Jun	73e492c9db	refactor: move from io/ioutil to io and os package (#1360 ) * refactor: move from io/ioutil to io and os package The io/ioutil package has been deprecated as of Go 1.16, see https://golang.org/doc/go1.16#ioutil. This commit replaces the existing io/ioutil functions with their new definitions in io and os packages. Signed-off-by: Eng Zer Jun <engzerjun@gmail.com> * chore: remove //nolint:gosec for os.ReadFile At the time of this commit, the G304 rule of gosec does not include the `os.ReadFile` function. We remove `//nolint:gosec` temporarily until https://github.com/securego/gosec/pull/706 is merged. Signed-off-by: Eng Zer Jun <engzerjun@gmail.com>	2021-10-06 08:39:10 -07:00
Jarek Kowalski	792cc874dc	repo: allow reusing of object writer buffers (#1315 ) This reduces memory consumption and speeds up backups. 1. Backing up kopia repository (3.5 GB files:133102 dirs:20074): before: 25s, 490 MB after: 21s, 445 MB 2. Large files (14.8 GB, 76 files) before: 30s, 597 MB after: 28s, 495 MB All tests repeated 5 times for clean local filesystem repo.	2021-09-25 14:54:31 -07:00
Jarek Kowalski	35d0f31c0d	huge: replaced the use of allocated byte slices with populating gather.WriteBuffer in the repository (#1244 ) This helps recycle buffers more efficiently during snapshots. Also, improved memory tracking, enabled profiling flags and added pprof by default.	2021-08-20 08:45:10 -07:00
Jarek Kowalski	40510c043d	Support for content-level compression (#1076 ) * cli: added a flag to create repository with v2 index features * content: plumb through compression.ID parameter to content.Manager.WriteContent() * content: expose content.Manager.SupportsContentCompression This allows object manager to decide whether to create compressed object or let the content manager do it. * object: if compression is requested and the repo supports it, pass compression ID to the content manager * cli: show compression status in 'repository status' * cli: output compression information in 'content list' and 'content stats' * content: compression and decompression support * content: unit tests for compression * object: compression tests * testing: added integration tests against v2 index * testing: run all e2e tests with and without content-level compression * htmlui: added UI for specifying index format on creation * cli: additional tests for 'content ls' and 'content stats' * applied pr suggestions	2021-05-22 05:35:27 -07:00
Jarek Kowalski	df430371b9	Refactored content.Info to be an interface and switched index parsing to be lazy (#1008 )	2021-04-27 05:53:52 -07:00
Jarek Kowalski	2062c07259	mechanical field renames (#988 ) * content: mechanical rename content.Info.Length -> content.Info.PackedLength * server: renamed grpc API ContentInfo.length->packed_length (non-breaking)	2021-04-16 22:42:32 -07:00
Jarek Kowalski	7c108930ef	testing: ensure tests are releasing all buffer pools to reduce memory usage, we had huge leaks (#895 ) * testing: ensure tests are releasing all buffer pools to reduce memory usage, we had huge leaks * object: reduced complexity and memory usage of TestEndToEndReadAndSeekWithCompression * manifest: more test fixes * trivial: update comment Co-authored-by: Julio López <julio+gh@kasten.io>	2021-03-18 06:40:33 -07:00
Jarek Kowalski	e2b9a81ac3	Major CI/CD refactoring and re-added support for ARM/ARM64 runners (#849 ) * ci: refactored CI/CD logic & Makefile - removed all travis CI emulation environment variables and replaced with: CI_TAG=<empty>\|tag IS_PULL_REQUEST=false\|true - refactored all OS and architecture-specific decisions to use around standard GOOS/GOARCH values instead of uname/OS - re-added self-hosted runner for ARMHF (3 replicas) - added brand new self-hosted runner for ARM64 (3 replicas) - disabled attempts to publish and sign on forks - improved integration test log output to better see timings and sub-tests - print longest tests (unit tests and integration) after each run - verified that all configurations build successfully on a clone (jkowalski/kopia) - run make setup in parallel * testing: fixed tests on ARM and ARM64 - fixed ARM-specific alignment issue - cleaned up test logging - fixed huge params warning threshold because it was tripping on ARM. - reduced test complexity to make them fit in 15 minutes	2021-02-23 00:52:54 -08:00
Jarek Kowalski	1f3b8d4da4	upgrade linter to 1.35 (#786 ) * lint: added test that enforces Makefile and GH action linter versions are in sync * workaround for linter gomnd problem - https://github.com/golangci/golangci-lint/issues/1653	2021-01-16 18:21:16 -08:00
Jarek Kowalski	ed9db56b87	Cleaned up and refactored object manager (#782 ) * object: refactored Open() and VerifyObject() to be stateless (no code movement yet to facilitate review) * mechanical: moved function more appropriate files * object: remove object manager tracing which was unused	2021-01-13 00:40:23 -08:00
Jarek Kowalski	e03971fc59	Upgraded linter to v1.33.0 (#734 ) * linter: upgraded to 1.33, disabled some linters * lint: fixed 'errorlint' errors This ensures that all error comparisons use errors.Is() or errors.As(). We will be wrapping more errors going forward so it's important that error checks are not strict everywhere. Verified that there are no exceptions for errorlint linter which guarantees that. * lint: fixed or suppressed wrapcheck errors * lint: nolintlint and misc cleanups Co-authored-by: Julio López <julio+gh@kasten.io>	2020-12-21 22:39:22 -08:00
Jarek Kowalski	66cebb79cb	Fixed empty object IDs in checkpoints (#649 ) * object: fixed race condition between Result() and Checkpoint() This would sometimes result in indirect objects having empty object IDs. Fixes #648 * upload: ensure checkpoints never containt empty object IDs. * testing: reduce armhf test weight	2020-09-29 07:14:47 -07:00
Jarek Kowalski	f0b97b960b	Fixed checkpointing to not restart the entire upload process (#594 ) * object: added Checkpoint() method to object writer * upload: refactored code structure to allow better checkpointing * upload: removed Checkpoint() method from UploadProgress * Update fs/entry.go Co-authored-by: Julio López <julio+gh@kasten.io>	2020-09-12 22:36:22 -07:00
Jarek Kowalski	e22d22dba2	object: implemented fast concatenation of objects by merging their index entries (#607 )	2020-09-11 20:12:01 -07:00
Jarek Kowalski	9a6dea898b	Linter upgrade to v1.30.0 (#526 ) * fixed godot linter errors * reformatted source with gofumpt * disabled some linters * fixed nolintlint warnings * fixed gci warnings * lint: fixed 'nestif' warnings * lint: fixed 'exhaustive' warnings * lint: fixed 'gocritic' warnings * lint: fixed 'noctx' warnings * lint: fixed 'wsl' warnings * lint: fixed 'goerr113' warnings * lint: fixed 'gosec' warnings * lint: upgraded linter to 1.30.0 * lint: more 'exhaustive' warnings Co-authored-by: Nick <nick@kasten.io>	2020-08-12 19:28:53 -07:00
Jarek Kowalski	40acf238f3	Fixed arm and arm64 build. (#506 ) * fixed a number of cases where misaligned data was causing panics on armv7 (but not armv8) * travis: enable arm64 * test: reduce compressed data sizes when running on arm * arm: wait longer for snapshots	2020-07-30 17:31:28 -07:00
Jarek Kowalski	573d10422a	object: ensure that all I objects have a content prefix When prefix is not specified on ObjectWriter, we force 'x' content prefix on intermediate contents, so object IDs will look like: Ix{hash} This ensures the index contents will be stored in `q` blobs, making `snapshot gc` easier.	2020-04-12 23:55:09 -07:00
Jarek Kowalski	8687f1c008	object: added AsyncWrites to ObjectWriter, which improves performance… (#369 ) * object: added AsyncWrites to ObjectWriter, which improves performance of uploading of a single file Fixes #351 Co-Authored-By: Julio López <julio+gh@kasten.io>	2020-03-22 09:02:33 -07:00
Jarek Kowalski	d181403284	crypto: refactored encryption, hashing and splitter into separate packages (#274 ) Added some tests, deleted XSALSA20 which never worked E2E	2020-02-27 12:36:49 -08:00
Jarek Kowalski	c8fcae93aa	logging: refactored logging This is mostly mechanical and changes how loggers are instantiated. Logger is now associated with a context, passed around all methods, (most methods had ctx, but had to add it in a few missing places). By default Kopia does not produce any logs, but it can be overridden, either locally for a nested context, by calling ctx = logging.WithLogger(ctx, newLoggerFunc) To override logs globally, call logging.SetDefaultLogger(newLoggerFunc) This refactoring allowed removing dependency from Kopia repo and go-logging library (the CLI still uses it, though). It is now also possible to have all test methods emit logs using t.Logf() so that they show up in failure reports, which should make debugging of test failures suck less.	2020-02-25 17:24:44 -08:00
Jarek Kowalski	0b8c4d0ef9	object: fixed compression bug where we were not clearing the buffer this effectively defeated the purpose of compression, caused high memory usage and other kinds of bad behavior. refactored the code to prevent this issue by resetting the buffer at the caller not callee. fixed previous e2e test to catch the issue mentioned in #166, verified it fails against master and passes with this change.	2020-01-09 16:36:57 -08:00
Jarek Kowalski	2ba4e83cef	moved all compression to separate package and sanitized identifiers	2019-12-10 23:25:28 -08:00
Jarek Kowalski	aec3cdcb2f	object: added support for compressed objects	2019-12-10 23:25:28 -08:00
Jarek Kowalski	cfa2eea1e1	object: stop verifying object length in VerifyObject With compression, the decompressed object length is not known until we read the content.	2019-12-10 23:25:28 -08:00
Jarek Kowalski	6217df1a87	lint: switched to 1.21 and fixed a ton of whitespace issues discovered by new wsl linter	2019-11-26 06:49:49 -08:00
Jarek Kowalski	f4afaec9b5	object: fixed Seek() behavior for seeking to end of object	2019-11-25 23:36:24 -08:00
Jarek Kowalski	72520029b0	golangci-lint: added more linters Also fixed pre-existing lint errors.	2019-06-02 22:56:57 -07:00
Jarek Kowalski	54edb97b3a	refactoring: renamed repo/block to repo/content Also introduced strongly typed content.ID and manifest.ID (instead of string) This aligns identifiers across all layers of repository: blob.ID content.ID object.ID manifest.ID	2019-06-01 22:24:19 -07:00
Jarek Kowalski	916da07e0f	deprecate block format v0	2019-06-01 16:40:51 -07:00

1 2

57 Commits