perf: reduce memory consumption across services by nledez · Pull Request #1047 · cgwire/zou

nledez · 2026-04-07T19:53:17Z

Problem
Several SQLAlchemy patterns cause excessive memory usage: Task.assignees is eagerly loaded via selectin on every query (70-80% don't need it), get_comments() triggers N+1 queries fetching persons one by one, backup_service loads all preview files into memory at once, and playlists_service fetches full ORM objects when only 2 fields are needed.

Solution

Fix N+1 in get_comments() by batch-fetching persons with get_persons_by_ids()
Stream preview files in backup_service with yield_per(500)
Use with_entities(id, extension) in playlists_service instead of full PreviewFile objects
Replace manual if key not in dict patterns with defaultdict in projects_service and time_spents_service
Flush index documents in batches in index_service instead of accumulating indefinitely
Change Task.assignees from lazy="selectin" to default lazy loading, add explicit selectinload only in the 3 bulk access sites (CSV export, schedule service, deletion service)

frankrousseau · 2026-04-08T07:01:22Z

zou/app/models/task.py

-    assignees = db.relationship(
-        "Person", secondary=TaskPersonLink.__table__, lazy="selectin"
-    )
+    assignees = db.relationship("Person", secondary=TaskPersonLink.__table__)


Please keep the selectin selection. It is more adapted to our case, where most consuming queries need it.

zou/app/services/time_spents_service.py

frankrousseau

See comments

- Fix N+1 queries in get_comments() by batch-fetching persons - Stream preview files in backup_service with yield_per(500) - Use with_entities for preview file lookup in playlists_service - Replace manual dict init patterns with defaultdict - Flush index documents in batches instead of accumulating - Change Task.assignees from eager selectin to lazy loading, add explicit selectinload only where assignees are accessed

Use yield_per(500) and Flask streaming response to avoid loading entire query results and building the full CSV string in memory.

frankrousseau reviewed Apr 8, 2026

View reviewed changes

zou/app/services/time_spents_service.py Show resolved Hide resolved

frankrousseau reviewed Apr 8, 2026

View reviewed changes

nledez added 2 commits April 14, 2026 14:32

perf(export): stream CSV responses instead of buffering in memory

0340b9f

Use yield_per(500) and Flask streaming response to avoid loading entire query results and building the full CSV string in memory.

nledez force-pushed the perf/reduce-memory-consumption branch from bb5f96b to 0340b9f Compare April 14, 2026 12:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: reduce memory consumption across services#1047

perf: reduce memory consumption across services#1047
nledez wants to merge 2 commits intocgwire:mainfrom
nledez:perf/reduce-memory-consumption

nledez commented Apr 7, 2026

Uh oh!

frankrousseau Apr 8, 2026

Uh oh!

Uh oh!

frankrousseau left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nledez commented Apr 7, 2026

Uh oh!

frankrousseau Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

frankrousseau left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants