PDF Question Answering with Vector Store

This project implements a PDF document question-answering system using LangChain, OpenAI, and FAISS vector store. It allows you to load a PDF document, split it into chunks, create embeddings, and perform question-answering tasks using the document's content.

Features

PDF document loading and processing
Text splitting with overlap for better context preservation
Vector embeddings using OpenAI
In-memory and persistent vector storage using FAISS
Question-answering capabilities using LangChain and OpenAI

Prerequisites

Python 3.8+
OpenAI API key

Installation

Clone the repository
Install dependencies using Pipenv:

pipenv install

Create a .env file in the project root and add your OpenAI API key:

OPENAI_API_KEY=your_api_key_here

Project Structure

main.py - Main application file containing the PDF processing and QA logic
faiss_index_react/ - Directory containing the saved FAISS vector store
react-paper.pdf - Sample PDF document for testing
.env - Environment variables file (not tracked in git)
Pipfile and Pipfile.lock - Python dependency management files

Usage

Activate the virtual environment:

pipenv shell

Run the main script:

python main.py

The script will:

Load the PDF document
Split it into manageable chunks
Create embeddings using OpenAI
Store the vectors in FAISS
Set up a question-answering chain using LangChain

Technologies Used

LangChain - Framework for developing applications powered by language models
OpenAI - For embeddings and language model
FAISS - Efficient similarity search and clustering of dense vectors
PyPDF - PDF document processing

License

This project is open source and available under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
main.py		main.py
react-paper.pdf		react-paper.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF Question Answering with Vector Store

Features

Prerequisites

Installation

Project Structure

Usage

Technologies Used

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

samuelcastro/pdf-rag-faiss

Folders and files

Latest commit

History

Repository files navigation

PDF Question Answering with Vector Store

Features

Prerequisites

Installation

Project Structure

Usage

Technologies Used

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages