kopia

mirror of https://github.com/kopia/kopia.git synced 2026-05-11 00:04:46 -04:00

Author	SHA1	Message	Date
Julio Lopez	8098f49c90	chore(ci): remove exclusion for unused `ctx` parameters (#4530 ) Remove unused-parameter exclusion for `ctx` in revive linter. --------- Signed-off-by: Matthieu MOREL <matthieu.morel35@gmail.com> Co-authored-by: Matthieu MOREL <matthieu.morel35@gmail.com>	2025-04-26 23:11:36 -07:00
Prasad Ghangal	3bf947d746	feat(repository): Metadata compression config support for directory and indirect content (#4080 ) * Configure compressor for k and x prefixed content Adds metadata compression setting to policy Add support to configure compressor for k and x prefixed content Set zstd-fastest as the default compressor for metadata in the policy Adds support to set and show metadata compression to kopia policy commands Adds metadata compression config to dir writer Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com> * Pass concatenate options with ConcatenateOptions struct Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com> * Move content compression handling to caller Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com> * Move handling manifests to manifest pkg Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com> * Correct const in server_test Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com> * Remove unnecessary whitespace Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com> * Disable metadata compression for < V2 format Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com> --------- Signed-off-by: Prasad Ghangal <prasad.ganghal@veeam.com>	2024-10-23 23:28:23 -07:00
Julio López	961a39039b	refactor(general): use `errors.New` where appropriate (#4160 ) Replaces 'errors.Errorf\("([^"]+)"\)' => 'errors.New("\1")'	2024-10-05 19:05:00 -07:00
Jarek Kowalski	e36fa78385	feat(snapshots): added support for per-directory splitter overrides (#3887 ) This is useful when backing up directories that have giant files aligned at MiB boundary, such as VM disk backups, etc.	2024-06-07 13:42:15 -07:00
Jarek Kowalski	d0fc1e03c4	fix(server): do not make blocking calls inside server status API (#3666 ) also reduce global server lock scope	2024-02-21 12:34:16 -08:00
Jarek Kowalski	524ffaf4b8	refactor(repository): added context to potentially blocking repository methods (#3654 ) Primarily for wiring a context.Context to a call to content.Manager.refresh, which was using a detached context.	2024-02-20 14:48:23 -08:00
Jarek Kowalski	cbc66f936d	chore(ci): upgraded linter to 1.53.3 (#3079 ) * chore(ci): upgraded linter to 1.53.3 This flagged a bunch of unused parameters, so the PR is larger than usual, but 99% mechanical. * separate lint CI task * run Lint in separate CI	2023-06-18 13:26:01 -07:00
Edward Betts	1e97574391	fix(general): correct spelling mistakes (#2684 )	2023-01-21 07:37:15 -08:00
Jarek Kowalski	78edd92692	refactor(repository): refactored Prometheus metrics (#2532 ) This may be a breaking change for users who rely on particular kopia metrics (unlikely): - introduced blob-level metrics: * `kopia_blob_download_full_blob_bytes_total` * `kopia_blob_download_partial_blob_bytes_total` * `kopia_blob_upload_bytes_total` * `kopia_blob_storage_latency_ms` - per-method latency distribution * `kopia_blob_errors_total` - per-method error counter - updated cache metrics to indicate particular cache * `kopia_cache_hit_bytes_total{cache="CACHE_TYPE"}` * `kopia_cache_hit_total{cache="CACHE_TYPE"}` * `kopia_cache_malformed_total{cache="CACHE_TYPE"}` * `kopia_cache_miss_total{cache="CACHE_TYPE"}` * `kopia_cache_miss_errors_total{cache="CACHE_TYPE"}` * `kopia_cache_miss_bytes_total{cache="CACHE_TYPE"}` * `kopia_cache_store_errors_total{cache="CACHE_TYPE"}` where `CACHE_TYPE` is one of `contents`, `metadata` or `index-blobs` - reorganized and unified content-level metrics: * `kopia_content_write_bytes_total` * `kopia_content_write_duration_nanos_total` * `kopia_content_compression_attempted_bytes_total` * `kopia_content_compression_attempted_duration_nanos_total` * `kopia_content_compression_savings_bytes_total` * `kopia_content_compressible_bytes_total` * `kopia_content_non_compressible_bytes_total` * `kopia_content_after_compression_bytes_total` * `kopia_content_decompressed_bytes_total` * `kopia_content_decompressed_duration_nanos_total` * `kopia_content_encrypted_bytes_total` * `kopia_content_encrypted_duration_nanos_total` * `kopia_content_hashed_bytes_total` * `kopia_content_hashed_duration_nanos_total` * `kopia_content_deduplicated_bytes_total` * `kopia_content_read_bytes_total` * `kopia_content_read_duration_nanos_total` * `kopia_content_decrypted_bytes_total` * `kopia_content_decrypted_duration_nanos_total` * `kopia_content_uploaded_bytes_total` Also introduced `internal/metrics` framework which constructs Prometheus metrics in a uniform way and will allow us to include some of these metrics in telemetry report in future PRs.	2022-11-10 05:30:06 +00:00
Jarek Kowalski	51dcaa985d	chore(ci): upgraded linter to 1.48.0 (#2294 ) Mechanically fixed all issues, added `lint-fix` make target.	2022-08-09 06:07:54 +00:00
Jarek Kowalski	23299c3451	refactor(repository): ensure MutableParameters are never cached (#2284 )	2022-08-06 18:11:32 -07:00
Jarek Kowalski	6160ee5668	refactor(repository): moved format blob management to separate package (#2245 ) * refactor(repository): moved format blob management to separate package This is completely mechanical, no behavior changes, only: - moved types and functions to a new package - adjusted visibility where needed - added missing godoc - renamed some identifiers to align with current usage - mechanically converted some top-level functions into member functions - fixed some mis-named variables * refactor(repository): moved content.FormatingOptions to format.ContentFormat	2022-07-30 14:13:52 -07:00
Jarek Kowalski	68b8afd43f	feat(snapshots): improved performance when uploading huge files (#2064 ) * feat(snapshots): improved performance when uploading huge files This is controlled by an upload policy which specifies the size threshold above which indvidual files are uploaded in parts and concatenated. This allows multiple threads to run splitting, hashing, compression and encryption in parallel, which was previously only possible across multiple files, but not when a single file was being uploaded. The default is 2GiB for now, so this feature only kicks in for very larger files. In the future we may lower this. Benchmark involved uploading a single 42.1 GB file which was a VM disk snapshot of fresh Ubuntu installation (fresh EXT4 partition with lots of zero bytes) to a brand-new filesystem repository on local SSD of M1 Pro Macbook Pro 2021. * before: 59-63s (~700 MB/s) * after: 15-17s (~2.6 GB/s) * additional test to ensure files are really e2e readable	2022-06-24 07:38:07 +00:00
Jarek Kowalski	9bf9cac7fb	refactor(repository): ensure we always parse content.ID and object.ID (#1960 ) * refactor(repository): ensure we always parse content.ID and object.ID This changes the types to be incompatible with string to prevent direct conversion to and from string. This has the additional benefit of reducing number of memory allocations and bytes for all IDs. content.ID went from 2 allocations to 1: typical case 32 characters + 16 bytes per-string overhead worst-case 65 characters + 16 bytes per-string overhead now: 34 bytes object.ID went from 2 allocations to 1: typical case 32 characters + 16 bytes per-string overhead worst-case 65 characters + 16 bytes per-string overhead now: 36 bytes * move index.{ID,IDRange} methods to separate files * replaced index.IDFromHash with content.IDFromHash externally * minor tweaks and additional tests * Update repo/content/index/id_test.go Co-authored-by: Julio Lopez <1953782+julio-lopez@users.noreply.github.com> * Update repo/content/index/id_test.go Co-authored-by: Julio Lopez <1953782+julio-lopez@users.noreply.github.com> * pr feedback * post-merge fixes * pr feedback * pr feedback * fixed subtle regression in sortedContents() This was actually not producing invalid results because of how base36 works, just not sorting as efficiently as it could. Co-authored-by: Julio Lopez <1953782+julio-lopez@users.noreply.github.com>	2022-05-25 14:15:56 +00:00
Jarek Kowalski	69dc7ba969	feat(repository): added 'hint' to Prefetch methods. (#1825 )	2022-03-12 23:16:39 -08:00
Jarek Kowalski	db7dcac33c	feat(repo): exposed PrefetchContents on the repo.Repository() (#1824 )	2022-03-12 14:36:54 -08:00
Jarek Kowalski	926e14aacb	feat(repository): added PrefetchObjects() API (#1779 ) * feat(repository): added precaching of data blobs * feat(repository): added utilities for converting ID slices to strings * feat(repository): added object.PrefetchBackingContents * feat(repository): implemented Repository.PrefetchObjects * feat(cli): added 'cache prefetch' subcommand * feat(repository): prefetch in parallel * added tests	2022-03-06 14:30:58 -08:00
Jarek Kowalski	792cc874dc	repo: allow reusing of object writer buffers (#1315 ) This reduces memory consumption and speeds up backups. 1. Backing up kopia repository (3.5 GB files:133102 dirs:20074): before: 25s, 490 MB after: 21s, 445 MB 2. Large files (14.8 GB, 76 files) before: 30s, 597 MB after: 28s, 495 MB All tests repeated 5 times for clean local filesystem repo.	2021-09-25 14:54:31 -07:00
Jarek Kowalski	35d0f31c0d	huge: replaced the use of allocated byte slices with populating gather.WriteBuffer in the repository (#1244 ) This helps recycle buffers more efficiently during snapshots. Also, improved memory tracking, enabled profiling flags and added pprof by default.	2021-08-20 08:45:10 -07:00
Jarek Kowalski	40510c043d	Support for content-level compression (#1076 ) * cli: added a flag to create repository with v2 index features * content: plumb through compression.ID parameter to content.Manager.WriteContent() * content: expose content.Manager.SupportsContentCompression This allows object manager to decide whether to create compressed object or let the content manager do it. * object: if compression is requested and the repo supports it, pass compression ID to the content manager * cli: show compression status in 'repository status' * cli: output compression information in 'content list' and 'content stats' * content: compression and decompression support * content: unit tests for compression * object: compression tests * testing: added integration tests against v2 index * testing: run all e2e tests with and without content-level compression * htmlui: added UI for specifying index format on creation * cli: additional tests for 'content ls' and 'content stats' * applied pr suggestions	2021-05-22 05:35:27 -07:00
Jarek Kowalski	ed9db56b87	Cleaned up and refactored object manager (#782 ) * object: refactored Open() and VerifyObject() to be stateless (no code movement yet to facilitate review) * mechanical: moved function more appropriate files * object: remove object manager tracing which was unused	2021-01-13 00:40:23 -08:00
Jarek Kowalski	d52af7cebb	fixed cases where nil was passed to errors.Wrap() causing nil to be returned (#747 )	2020-12-24 20:35:29 -08:00
Jarek Kowalski	e03971fc59	Upgraded linter to v1.33.0 (#734 ) * linter: upgraded to 1.33, disabled some linters * lint: fixed 'errorlint' errors This ensures that all error comparisons use errors.Is() or errors.As(). We will be wrapping more errors going forward so it's important that error checks are not strict everywhere. Verified that there are no exceptions for errorlint linter which guarantees that. * lint: fixed or suppressed wrapcheck errors * lint: nolintlint and misc cleanups Co-authored-by: Julio López <julio+gh@kasten.io>	2020-12-21 22:39:22 -08:00
Jarek Kowalski	e22d22dba2	object: implemented fast concatenation of objects by merging their index entries (#607 )	2020-09-11 20:12:01 -07:00
Julio López	aebf8616a2	object: loadSeekTable helper (#608 )	2020-09-11 19:12:13 -07:00
Jarek Kowalski	9a6dea898b	Linter upgrade to v1.30.0 (#526 ) * fixed godot linter errors * reformatted source with gofumpt * disabled some linters * fixed nolintlint warnings * fixed gci warnings * lint: fixed 'nestif' warnings * lint: fixed 'exhaustive' warnings * lint: fixed 'gocritic' warnings * lint: fixed 'noctx' warnings * lint: fixed 'wsl' warnings * lint: fixed 'goerr113' warnings * lint: fixed 'gosec' warnings * lint: upgraded linter to 1.30.0 * lint: more 'exhaustive' warnings Co-authored-by: Nick <nick@kasten.io>	2020-08-12 19:28:53 -07:00
Jarek Kowalski	8687f1c008	object: added AsyncWrites to ObjectWriter, which improves performance… (#369 ) * object: added AsyncWrites to ObjectWriter, which improves performance of uploading of a single file Fixes #351 Co-Authored-By: Julio López <julio+gh@kasten.io>	2020-03-22 09:02:33 -07:00
Jarek Kowalski	239d809075	performance: introduced buf.Pool which helps reuse memory buffers (#345 ) * performance: added buf.Pool which can be used to manage ephemeral buffers for encryption and compression * repo: switched object writer to buf.Pool * content: switched encryption to use buf.Pool * object: switched compression to use buf.Pool * testing: added missing content manager Close()	2020-03-18 20:42:16 -07:00
Jarek Kowalski	8d452a8285	performance: improvements to object manager (#336 ) - added pooled splitters and ability to reset them without having to recreate - added support for caller-provided compressor output to be able to pool it - added pooling of compressor instances, since those are costly	2020-03-13 08:56:18 -07:00
Jarek Kowalski	d181403284	crypto: refactored encryption, hashing and splitter into separate packages (#274 ) Added some tests, deleted XSALSA20 which never worked E2E	2020-02-27 12:36:49 -08:00
Jarek Kowalski	ac70a38101	lint: upgraded to 1.22.2 and make lint issues a build failure fixed or silenced linter warnings, mostly due to magic numeric constants	2020-01-03 16:39:30 -08:00
Jarek Kowalski	2ba4e83cef	moved all compression to separate package and sanitized identifiers	2019-12-10 23:25:28 -08:00
Jarek Kowalski	aec3cdcb2f	object: added support for compressed objects	2019-12-10 23:25:28 -08:00
Jarek Kowalski	cfa2eea1e1	object: stop verifying object length in VerifyObject With compression, the decompressed object length is not known until we read the content.	2019-12-10 23:25:28 -08:00
Jarek Kowalski	6217df1a87	lint: switched to 1.21 and fixed a ton of whitespace issues discovered by new wsl linter	2019-11-26 06:49:49 -08:00
Jarek Kowalski	72520029b0	golangci-lint: added more linters Also fixed pre-existing lint errors.	2019-06-02 22:56:57 -07:00
Jarek Kowalski	54edb97b3a	refactoring: renamed repo/block to repo/content Also introduced strongly typed content.ID and manifest.ID (instead of string) This aligns identifiers across all layers of repository: blob.ID content.ID object.ID manifest.ID	2019-06-01 22:24:19 -07:00
Jarek Kowalski	9e5d0beccd	refactoring: renamed storage.Storage to blob.Storage This updates the terminology everywhere - blocks become blobs and `storage.Storage` becomes `blob.Storage`. Also introduced blob.ID which is a specialized string type, that's different from CABS block ID. Also renamed CLI subcommands from `kopia storage` to `kopia blob`. While at it introduced `block.ErrBlockNotFound` and `object.ErrObjectNotFound` that do not leak from lower layers.	2019-06-01 14:10:35 -07:00
Jarek Kowalski	1a7a02ddbe	cleanup imports by grouping all local imports together	2019-06-01 10:57:55 -07:00
Jarek Kowalski	63303904e1	switched remaining fmt.Errorf to errors.Wrap()	2019-06-01 10:57:05 -07:00
Jarek Kowalski	03339c18af	[breaking change] deprecated DYNAMIC splitter due to license issue The splitter in question was depending on github.com/silvasur/buzhash which is not licensed according to FOSSA bot Switched to new faster implementation of buzhash, which is unfortunately incompatible and will split the objects in different places. This change is be semi-breaking - old repositories can be read, but when uploading large objects they will be re-uploaded where previously they would be de-duped. Also added 'benchmark splitters' subcommand and moved 'block cryptobenchmark' subcommand to 'benchmark crypto'.	2019-05-30 22:20:45 -07:00
Jarek Kowalski	0c41d41276	Fixed up paths after merge	2019-05-27 15:48:39 -07:00
Jarek Kowalski	327d8317d8	refactored repo/ into separate github.com/kopia/repo/ git repository	2018-10-26 20:40:57 -07:00
Jarek Kowalski	dd1c0943cd	refactor: moved config file management to kopia/kopia/repo also fixed layering issue and removed dependency from 'object' on 'config'	2018-10-23 19:43:43 -07:00
Jarek Kowalski	1bbc169c0d	added missing package godoc, fixed test paths	2018-08-31 19:12:58 -07:00
Jarek Kowalski	91066f2469	reorganized low-level repository packages by moving them all under kopia/kopia/repo/	2018-08-30 22:01:05 -07:00

46 Commits