Skip to content

Implement Indexer and Retriever functionalities for given environment clearance data #2

Description

@KaranrajM

Description

Strategize a suitable chunking technique to index the given environment clearance data, where each file contains a list of projects and their details. Additionally, implement a retriever that can perform the following actions:

  • Search and look up a project with given location details (latitude, longitude, or State and City)
  • Query for specific details only within the project

Goal

To develop an information retrieval system specific to environment clearance data.

Expected Outcome

  • Efficient indexer and retriever services that enable users to:
    • Lookup and search for a list of projects in a particular location
    • Query specific details within a project

Acceptance Criteria

An information retrieval system specific to environment clearance data with high accuracy.

Implementation Details

  1. Implement a suitable and efficient chunking technique for the given dataset.
  2. Build a retriever that can:
    • Search and look up a project using location details (latitude, longitude, or State and City).
    • Search for specific details only within the project.
  3. Sample parsed and cleaned EC data can be found here. Their respective bare data can be found here.

Mockups/Wireframes

NOT APPLICABLE

Product Name

Jugalbandi

Organisation Name

OpenNyAI

Domain

Legal

Tech Skills Needed

  • Python

Requisites

  • OpenAI API key for Indexing and RAG

Complexity

Medium

Category

Backend

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions