Feature proposal: unsupervised manifold learning to understand un-labelled datasets ?

nrtk currently helps to evaluate the robustness of existing models or algorithms. The goal would be to expend its functionality to enable insights from datasets alone.
Using unsupervised manifold learning, one can automatically extract insights from an un-annotated dataset by projecting the n-dimensionnal latent space to a 3D/2D reduce space (with PCA, UMAP or T-SNE as you are already doing).

The additionnal features for nrtk would be:
1. Select an "embedding model" such as a convolutionnal VAE, or classical non-reference image quality like [opencv BRISQUE](https://docs.opencv.org/4.x/d8/d99/classcv_1_1quality_1_1QualityBRISQUE.html).
2. Train the model if required (for ML-based techniques like CVAE)
3. Infer the model on an un-labelled dataset (optionally with augmentation) (_not sure  if nrtk support no annotations_)
4. Visualize the features on the new space

We are currently working on this topic with ifremer, you can find a first POC [here](https://gitlab.kitware.com/keu-computervision/ifremer/obscame-dataset-quality). The current visualization is really basic with a static file (html) exported from bokeh.

Let us know what you think,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature proposal: unsupervised manifold learning to understand un-labelled datasets ? #6

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Feature proposal: unsupervised manifold learning to understand un-labelled datasets ? #6

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions