-
Notifications
You must be signed in to change notification settings - Fork 19
Open
Description
Hello! Thanks for pyorc; using it has been a pleasure so far, with the exception that we seem to be running into memory issues. I think Writer is leaking memory? Our workload is roughly:
- Open ~100 writers to different files
- Iterate over our input rows (in the millions) and send each row to exactly one writer
- Close all writers
- Repeat
Memory usage will grow without bound between iterations. This, coupled with the fact that lowering the stripe size all the way down to 1M has no effect, makes me suspect a memory leak. Below is a script that will reproduce -- around iteration 10 it gets to 20G and then killed by the OOM killer on my machine. Let me know if there's anything I can do to help track it down!
https://gist.github.com/JohnEmhoff/274f6e05cba3f17a16683eb394bfe6b5
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels