Add sparse file support #35

zwhitchcox · 2024-12-07T18:53:03Z

Add support for sparse files using old gnu format

This is the current default for gnu tar generated tar files, including extended headers.

Also supports raw mode, which will read the sparse file map and only emit chunks actually containing data corresponding to the sparse map. This is useful to save memory while streaming a sparse file.

* add test cases for multiple extended headers * fix bug where 80+ extended headers would crash tar

mjackson

Thanks for the PR, @zwhitchcox! I'd like to see a few small changes before merging.

mjackson · 2024-12-18T00:49:32Z

packages/tar-parser/README.md

+});
+```
+
+If you prefer the raw data chunks as they appear in the archive (without reconstructing zeros), you can call `entry.bytes({ raw: true })`:


When you say "without reconstructing zeros" do you mean that the resulting byte array will have zeroes in it? That's just spacer data, right? Forgive my ignorance, but when would that ever be useful?

This is the default behavior of tar.

Basically, if someone sends you a sparse tarball, you don't have to do anything differently if you just pipe to a write stream for the file. So, for someone with no knowledge of sparse files, it "just work".

However, if you're a more advanced user, you can get the sparse offset's and lengths, and use fs.write(fd, buffer, offset, length), which will be more efficient, because you're not writing the extra data.

zwhitchcox added 3 commits December 7, 2024 13:49

add sparse test file

ebca3e3

support sparse raw mode

b0a4bcd

add docs for sparse file

b012cc8

zwhitchcox force-pushed the main branch from f72a79c to b012cc8 Compare December 7, 2024 19:12

zwhitchcox added 2 commits December 8, 2024 18:26

Multiple extended headers:

e2007cb

* add test cases for multiple extended headers * fix bug where 80+ extended headers would crash tar

convert extended gnu type character representations to numbers

cbd811a

mjackson reviewed Dec 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sparse file support #35

Add sparse file support #35

zwhitchcox commented Dec 7, 2024 •

edited

Loading

mjackson left a comment

mjackson Dec 18, 2024

zwhitchcox Dec 18, 2024

Add sparse file support #35

Are you sure you want to change the base?

Add sparse file support #35

Conversation

zwhitchcox commented Dec 7, 2024 • edited Loading

mjackson left a comment

Choose a reason for hiding this comment

mjackson Dec 18, 2024

Choose a reason for hiding this comment

zwhitchcox Dec 18, 2024

Choose a reason for hiding this comment

zwhitchcox commented Dec 7, 2024 •

edited

Loading