Skip to content

Use h5py for output data writing and consolidation to reduce memory footprint#10

Closed
thomas-a-neil wants to merge 2 commits intomasterfrom
change-output-to-h5
Closed

Use h5py for output data writing and consolidation to reduce memory footprint#10
thomas-a-neil wants to merge 2 commits intomasterfrom
change-output-to-h5

Conversation

@thomas-a-neil
Copy link
Copy Markdown
Member

Building on CannyLab/rinokeras#12, the data consolidation step will read the entire output dataset into memory (which will crash for relatively small datasets if we include all encoder outputs, especially for the LSTM).

hdf5 allows us to iteratively write, and avoid the memory overhead of pickle

Upon reflection, some documentation update should probably be done as well, because I think we reference pickle a few time

@thomas-a-neil thomas-a-neil requested a review from rmrao August 8, 2019 23:49
@thomas-a-neil
Copy link
Copy Markdown
Member Author

This should also help with songlab-cal/tape#8

@rmrao
Copy link
Copy Markdown
Collaborator

rmrao commented Aug 25, 2019

Should we merge this? I don't think the rinokeras changes have been merged to master?

@thomas-a-neil
Copy link
Copy Markdown
Member Author

It depends on rinokeras changes, so I don't think we can merge it yet.

@rmrao
Copy link
Copy Markdown
Collaborator

rmrao commented Jan 7, 2020

Closing since both this and rinokeras are in basic maintenance mode now, so no major changes will be made.

@rmrao rmrao closed this Jan 7, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants