Easiest Way to Use Agentic RAG in Any Enterprise - RAGapp - Install Locally
AI Nuggets
Instructions for Setting Up RAG Locally
Prerequisites:
- Operating System: Ubuntu 22.4
- GPU: At least 16 GB of VRAM recommended for AMA models (22 GB used in the video)
- Docker: Ensure Docker is installed (version 26.1.3 mentioned)
- Conda: Recommended for creating a virtual environment
Steps:
- Create a Conda Virtual Environment:
conda create -n <environment_name>
- Press
y
to confirm.- Activate the virtual environment:
conda activate <environment_name>
- Clone the GitHub Repository:
- Clone the repository (URL provided in the video description):
git clone <repository_url>
- Navigate to the cloned directory:
cd <cloned_directory>
- Run the Docker Command:
- Execute the Docker command to run the rag app:
docker run -p 8000:8000 <docker_image_name>
- Ignore any warnings if the image is already downloaded.
- Access the Application:
- Open the application by navigating to
localhost:8000
in your web browser.- Configure the Application:
- Select the desired AI provider (OpenAI, GPT-J, or AMA).
- Enter your API key from the providerās platform (e.g.,
platform.openai.com
for OpenAI).- Click on āUpdateā to save the configuration.
- Upload Your Data:
- Provide a prompt or context for the model.
- Upload documents or files that contain the data you want the model to use.
- Optionally, you can use tools to fetch data from Wikipedia, DuckDuckGo, or other websites.
- Test the Application:
- Enter a query and observe the retrieved context and generated response.
- The application will display the sources used for generating the response.
- Distribute the Application to Users:
- Click on āStart the appā to open it in another browser window.
- Share the URL with users for them to access the application.
Additional Tips:
- Kubernetes Deployment:
- The application is container-based and can be run on Kubernetes.
- It can be exposed through a load balancer in cloud environments like AWS EKS, Azure AKS, or Google GKE.
- Note: The application does not include authentication or authorization, which needs to be implemented separately.
- Security Considerations:
- To secure the application, you can use a load balancer, CloudFront, API Gateway, or integrate with identity access management systems.
Video Description Links:
- The video description contains the link to the GitHub repository for the rag app.
- The API key for OpenAI can be obtained from platform.openai.com.
Upcoming Content:
- A subsequent video will demonstrate how to use the application with AMA once integration issues are resolved.
Call to Action:
- The video encourages subscribing to the channel and sharing the content with others.
(Note: The exact URLs and commands are not provided in the transcript and are expected to be found in the video description or by following the instructions in the video.)