This repository was archived by the owner on Jul 18, 2024. It is now read-only.

This repository was archived by the owner on Jul 18, 2024. It is now read-only.

Refactor doc_loader.py to load documents concurrently using Ray actors or Spark tasks, instead of loading them all at once and then putting them into a dataset #510

Open

opened

on Dec 26, 2023

Refactor doc_loader.py to load documents concurrently using Ray actors or Spark tasks, instead of loading them all at once and then putting them into a dataset

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests