feat: send files directly from bytes by EdJoPaTo · Pull Request #260 · ayrat555/frankenstein

EdJoPaTo · 2025-02-21T00:20:51Z

No description provided.

less confusing to read

EdJoPaTo · 2025-02-21T00:30:41Z

src/client_ureq.rs

While this PR seems awesome at first glance it turns out as quite a performance mess with bigger files. The params are parsed with serde_json which involves parsing megabytes of data (for InputFile::bytes) only to throw them away as they are handled via the files Vec rather than the params.

Possible ideas: on every method that sends files the InputFile needs to be replaced with an (empty?) String to ensure there isn't much data touched for the params. As that are references currently, it will require a clone, so the InputFile will be at least twice in memory. With self-hosted bot API servers that can be quite a lot of memory usage.

Alternatively, the params of API methods which handle files need lifetimes and only a reference to the actual data. Then the clones are way cheaper. Also, the params could be moved rather than referenced into the methods which allows for modifying them instead of cloning. If the developer using this library requires the params multiple times, they can still clone it themselves which would still be pretty cheap.
I tinkered with std::borrow::Cow but that doesn't change the fact that it requires the same: lifetimes for all the Param structs. ('static is pretty useless in this case as most stuff would require Cow::Owned then again → nothing won.)

An alternative might be proc macros knowing which variable in a struct is what. So InputFile wouldn't end up in the params in the first place. But I think this is another rabbit hole and which some more complex file handling methods, probably very messy. → probably not a good idea.

Quick Idea: Does skip serializing work with a certain enum type? (Option::None works) Then bytes could be skipped to be serialized?

Conflicts: examples/async_file_upload.rs src/client_reqwest.rs src/client_ureq.rs

Conflicts: src/trait_async.rs src/trait_sync.rs

removing that is part of another PR

Conflicts: src/api_params.rs src/client_ureq.rs src/trait_async.rs src/trait_sync.rs

Conflicts: src/client_ureq.rs src/macros.rs src/trait_async.rs src/trait_sync.rs

EdJoPaTo · 2025-03-11T00:32:35Z

src/client_reqwest.rs

This bytes.clone() is kinda horrible. Even with the rest of this approach being refactored into references, reqwest still requires a clone here.
And for 1.5 GB file data, this is definitely a lot.

The best we could do would require the ownership and pass it over to reqwest so it's never cloned by frankenstein and only explicitly by the user of frankenstein when needed.

ureq would work with references while reqwest doesn't. But frankenstein requires either both with references or none.

When someone wants to upload a file the FileUpload type is required. They can read about it in the autogenerated docs themselves. No need to manually add that to the README again.

EdJoPaTo · 2025-03-13T18:03:50Z

The current state is definitely useable but kinda annoying due to the clones and the implicit serialization as parameters. Its not a big deal with PathBuf but with binary data it has at least 2 instances of it in memory, some methods even more. Sending some MB is totally fine but sending 1.5 GB files is not.

I am thinking about putting the binary stuff behind a feature flag. That way it's useable but not an easy foot gun on bigger files.

The alternative is to take the ownership of all parameters instead of taking their references. That's breaking but would prevent such huge data duplications in memory. And when it should be duplicated, the user actively has to call clone themselves.

A lite step of that would be to take only the ownership for methods that involve files. But its probably a weird interface then as it differs between methods.

Not sure which is the better way forward with this PR.

Conflicts: examples/api_trait_implementation.rs examples/file_upload.rs

They might not be perfectly accurate now (might not accept URLs or file_id) but they are far easier to handle with macros. Fixes setWebhook which never correctly uploaded certificate.

EdJoPaTo added 3 commits February 20, 2025 19:11

feat: send files directly from bytes

a5bd40f

refactor: less error-prone new_params without InputFile

bc10f7c

refactor: move internal method to the end of the impl

d56bf46

less confusing to read

EdJoPaTo mentioned this pull request Feb 21, 2025

refactor(client-ureq): remove mime_guess #261

Closed

EdJoPaTo commented Feb 21, 2025

View reviewed changes

EdJoPaTo added 7 commits February 21, 2025 13:36

Merge branch 'master' into edjopato/filehandling

1550453

Conflicts: examples/async_file_upload.rs src/client_reqwest.rs src/client_ureq.rs

Merge branch 'master' into edjopato/filehandling

44bcac6

Conflicts: src/trait_async.rs src/trait_sync.rs

fix(client-ureq): add back mime_guess

3feeaa6

removing that is part of another PR

Merge remote-tracking branch 'origin/master' into edjopato/filehandling

cb87c98

Conflicts: src/api_params.rs src/client_ureq.rs src/trait_async.rs src/trait_sync.rs

test(client-ureq): use include_bytes for more realistic dummyfile

64625c3

fix(macros): InputFile no longer has impl From

c94e76c

Merge branch 'master' into edjopato/filehandling

7ccfa89

Conflicts: src/client_ureq.rs src/macros.rs src/trait_async.rs src/trait_sync.rs

EdJoPaTo linked an issue Mar 5, 2025 that may be closed by this pull request

send file using bytes #183

Open

EdJoPaTo added 2 commits March 10, 2025 22:59

Merge remote-tracking branch 'origin/master' into edjopato/filehandling

a4e50b7

refactor: use simpler io::Error::other

c5629e0

EdJoPaTo commented Mar 11, 2025

View reviewed changes

perf(files): keep PathBuf variant as its optimized by multistream

1ff7f49

EdJoPaTo force-pushed the edjopato/filehandling branch from 95ecdc8 to 1ff7f49 Compare March 11, 2025 01:45

docs(readme): remove relatively clear upload docs

a168c9b

When someone wants to upload a file the FileUpload type is required. They can read about it in the autogenerated docs themselves. No need to manually add that to the README again.

Merge remote-tracking branch 'origin/master' into edjopato/filehandling

02a1936

EdJoPaTo mentioned this pull request Mar 24, 2025

send file using bytes #183

Open

EdJoPaTo added 6 commits April 8, 2025 13:06

Merge remote-tracking branch 'origin/master' into edjopato/filehandling

dc2b8d7

Conflicts: examples/api_trait_implementation.rs examples/file_upload.rs

refactor: generalize methods with file uploads

ac2aaf2

They might not be perfectly accurate now (might not accept URLs or file_id) but they are far easier to handle with macros. Fixes setWebhook which never correctly uploaded certificate.

docs(traits): add docs for the manually implemented methods

414c573

feat(files): move instead of clone

80f83ec

fixup! feat(files): move instead of clone

e616f3b

perf: parse directly into map

6676ee3

EdJoPaTo mentioned this pull request Aug 13, 2025

Add possiblity to provide file as Vec<u8> for upload, not just filesystem path #296

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: send files directly from bytes#260

feat: send files directly from bytes#260
EdJoPaTo wants to merge 21 commits intomasterfrom
edjopato/filehandling

EdJoPaTo commented Feb 21, 2025

Uh oh!

EdJoPaTo Feb 21, 2025

Uh oh!

EdJoPaTo Aug 13, 2025

Uh oh!

EdJoPaTo Mar 11, 2025

Uh oh!

EdJoPaTo Mar 11, 2025

Uh oh!

EdJoPaTo commented Mar 13, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

EdJoPaTo commented Feb 21, 2025

Uh oh!

EdJoPaTo Feb 21, 2025

Choose a reason for hiding this comment

Uh oh!

EdJoPaTo Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

EdJoPaTo Mar 11, 2025

Choose a reason for hiding this comment

Uh oh!

EdJoPaTo Mar 11, 2025

Choose a reason for hiding this comment

Uh oh!

EdJoPaTo commented Mar 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

EdJoPaTo commented Mar 13, 2025 •

edited

Loading