Skip to content

BUG: Same image gets indexed multiple times when directory contains symlinks/hardlinks #1181

@roushnisareen

Description

@roushnisareen

Is there an existing issue for this?

  • I have searched the existing issues

What happened?

Describe the bug

When a library includes symbolic links (or hardlinks) to photos, PictoPy appears to index the same underlying image multiple times instead of treating it as one media item.

Steps to reproduce

  1. Create folder A with 20 images.
  2. Create folder B containing symlinks to the same 20 images from A (or hardlinks).
  3. Add both folders in PictoPy settings.
  4. Start scan/indexing.

Expected behavior

Each physical image should be indexed once (deduplicated by inode/path-resolve/hash strategy), with a single media record in the UI.

Actual behavior

Duplicate entries appear for the same image content, and indexing/AI tagging runs repeatedly for linked copies.

Impact

  • Slower scans and higher CPU usage
  • Duplicate thumbnails/search results
  • Larger metadata DB than necessary

Suggested fix

  • Normalize file identity before insert (resolve symlink + stable file identity check).
  • Add a unique constraint / duplicate guard in ingestion pipeline.
  • Skip AI pipeline when file identity already exists.

Record

  • I agree to follow this project's Code of Conduct

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions