Determine the total buffer size before starting to read LH5 data from a list of files #93

gipert · 2024-05-11T09:43:06Z

Should we avoid resizing buffers every time a new file is read-appended and instead already allocate a buffer of the right total size? Would this approach be less memory hungry and faster?

iguinn · 2024-07-20T20:41:36Z

I think this would help with both memory and speed, but I'm not sure how we get the total size first without slowing things down (at least with core.read and store.read). I think this is a great idea for the data loader, as we could store sizes in the file DB.

Another option would be to allow the memory buffers for LGDO objects to be larger than the size. Then instead of increasing the size every time we add a new file, we could, say, double the size when the buffer is full. In other words, we could turn our arrays into C++ vectors.

gipert added performance Code performance lh5 HDF5 I/O labels May 11, 2024

iguinn mentioned this issue Sep 19, 2024

Separate size and capacity of arrays in LGDO objects #107

Open

gipert linked a pull request Nov 9, 2024 that will close this issue

Separate size and capacity of LGDO ArrayLike objects #109

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Determine the total buffer size before starting to read LH5 data from a list of files #93

Determine the total buffer size before starting to read LH5 data from a list of files #93

gipert commented May 11, 2024

iguinn commented Jul 20, 2024

Determine the total buffer size before starting to read LH5 data from a list of files #93

Determine the total buffer size before starting to read LH5 data from a list of files #93

Comments

gipert commented May 11, 2024

iguinn commented Jul 20, 2024