Move PCA and TSVD from cuml to raft by aamijar · Pull Request #2952 · rapidsai/raft

aamijar · 2026-02-13T09:09:54Z

This PR moves pca.cuh, tsvd.cuh, and gtests into raft.

cjnolet · 2026-02-13T16:50:32Z

cpp/include/raft/linalg/pca.cuh

+
+template <typename math_t, typename enum_solver = solver>
+void truncCompExpVars(const raft::handle_t& handle,
+                      math_t* in,


Need to use mdspan here- we've deprecated all the pointer APIs.

cjnolet · 2026-02-13T16:51:20Z

cpp/include/raft/linalg/pca.cuh

+            math_t* input,
+            math_t* components,
+            math_t* explained_var,
+            math_t* explained_var_ratio,


Input order should match the other (newer APIs). handle, params, input, output, free params. Also "stream" is in the handle now, and we use device_resources not raft::hande.

jinsolp

Thanks @aamijar ! just a minor comment.
Question: will this be imported in cuvs and exposed as a python API?

jinsolp · 2026-02-14T00:23:20Z

cpp/include/raft/linalg/detail/pca.cuh

+/**
+ * @brief perform fit operation for the pca. Generates eigenvectors, explained vars, singular vals,
+ * etc.
+ * @param[in] handle: cuml handle object


doc mentioning cuml handle, not raft device_resources! (same for other docs too)

aamijar · 2026-02-14T00:35:49Z

Question: will this be imported in cuvs and exposed as a python API?

We will still have the same python and cpp apis in cuml too!
On the cuvs side I think the plan is to expose a cpp api.

cjnolet · 2026-02-14T01:14:25Z

@aamijar we will probably expose a preprocessing api through python for purposes of users who need to write scripts (for example Jinsol's new dataset gen requires PCA and it would be a circular dependency if we included cuml in cuVS) or have databases written in python.

But- like I mentioned to Simon, the users are very diffeeent between the two. Same thing with kmeans- kmeans clusters is the equivalent of "lexicograph ordering" in the vector world. Pca is another way to reduce footprint of vectors without losing quality.

Data science users will continue to use cuml. Vector databases will continue to use cuVS. It's important we don't duplicate code across the two... and since cuml is already using cuVS, it can continue to use the c++ api like you mentioned.

move-pca-from-cuml

afd395d

aamijar requested review from a team as code owners February 13, 2026 09:09

github-project-automation bot added this to Vector Search, ML, & Data Mining Release Board Feb 13, 2026

aamijar self-assigned this Feb 13, 2026

aamijar added non-breaking Non-breaking change feature request New feature or request labels Feb 13, 2026

Merge branch 'main' into move-pca-from-cuml

8d80e80

aamijar moved this to In Progress in Vector Search, ML, & Data Mining Release Board Feb 13, 2026

cjnolet reviewed Feb 13, 2026

View reviewed changes

jinsolp reviewed Feb 14, 2026

View reviewed changes

aamijar added 5 commits February 14, 2026 01:50

mdspan public api

7289840

remove default template type

0c86857

update docstring

9e285e6

remove fixme comment

529514e

expose more tsvd functions

e9f6e2c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move PCA and TSVD from cuml to raft#2952

Move PCA and TSVD from cuml to raft#2952
aamijar wants to merge 7 commits intorapidsai:mainfrom
aamijar:move-pca-from-cuml

aamijar commented Feb 13, 2026

Uh oh!

cjnolet Feb 13, 2026

Uh oh!

cjnolet Feb 13, 2026

Uh oh!

jinsolp left a comment

Uh oh!

jinsolp Feb 14, 2026

Uh oh!

aamijar commented Feb 14, 2026

Uh oh!

cjnolet commented Feb 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

aamijar commented Feb 13, 2026

Uh oh!

cjnolet Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

cjnolet Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

jinsolp left a comment

Choose a reason for hiding this comment

Uh oh!

jinsolp Feb 14, 2026

Choose a reason for hiding this comment

Uh oh!

aamijar commented Feb 14, 2026

Uh oh!

cjnolet commented Feb 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cjnolet commented Feb 14, 2026 •

edited

Loading