Build a Personal AI Assistant in a Weekend With Python

Building your own personal AI assistant — one that knows your preferences, has access to your notes, can search the web, and runs tasks for you — is now achievable in a weekend with Python, LangChain, and a free LLM API. A complete beginner-to-working-project guide.

By Shikhar Burman · 2026-03-13 · 11 min read · Study Tips

The idea of a personal AI assistant — one that knows your context, can access your files, searches the web when needed, and executes tasks on your behalf — has moved from science fiction to weekend project in 2026. The tools available to Indian students and developers now include free and low-cost LLM APIs, mature Python frameworks like LangChain and LlamaIndex, free vector databases for memory, and free cloud hosting for deployment. A functional personal AI assistant is achievable in a weekend by a student with intermediate Python skills.

This guide walks you through building a personal AI assistant from scratch — one that you can customise for your specific needs: a study assistant that knows your syllabus, a coding assistant that knows your projects, or a research assistant that has access to your saved articles and notes. The skills you develop building this project are directly transferable to the AI engineering portfolio work that employers are actively seeking.

What Your Personal AI Assistant Will Do

By the end of this guide, you will have a working AI assistant that: answers questions from your uploaded documents (notes, textbooks, articles) with citations, searches the web in real time when it needs current information, remembers context from previous conversations, and is accessible through a simple chat interface. This is a RAG system with web search augmentation and persistent memory — a combination of three of the most valuable AI engineering skills in 2026.

Architecture: The Components You Will Build

LLM backbone — Claude Sonnet 4.6 via Anthropic API or any model available through OpenRouter. Choose based on cost: OpenRouter provides access to multiple models including DeepSeek (cheapest) and Claude (highest quality) through one API key.
Document memory (RAG) — LangChain's document loading and chunking pipeline, HuggingFace embeddings (free, runs locally), and ChromaDB (free local vector store). Your uploaded notes and documents become searchable memory.
Web search tool — Tavily Search API (free tier available) or DuckDuckGo search wrapper in LangChain. Gives your assistant the ability to find current information it does not have in its context.
Conversation memory — LangChain's ConversationBufferWindowMemory to maintain recent context, or ChromaDB for long-term memory across sessions.
Interface — Streamlit (free, Python-native) for a simple web chat UI that you can run locally or deploy to Streamlit Cloud for free.

Step 1: Project Setup

Create a Python virtual environment and install: langchain langchain-anthropic (or langchain-openai), chromadb, sentence-transformers, tavily-python, streamlit, and pypdf. Get a free API key from Anthropic (claude.ai/api) or OpenRouter (openrouter.ai). Get a free Tavily API key from tavily.com — the free tier provides 1,000 searches per month, which is more than enough for personal use.

Step 2: Document Ingestion Pipeline

The document pipeline loads your files, splits them into chunks, generates embeddings (numerical representations that capture semantic meaning), and stores them in ChromaDB. Build this as a separate ingest.py script that you run whenever you add new documents. Use LangChain's DirectoryLoader to load all PDFs and text files from a folder, RecursiveCharacterTextSplitter to chunk them at 1000 characters with 200-character overlap, and HuggingFaceEmbeddings with 'all-MiniLM-L6-v2' to generate free local embeddings.

Step 3: The Assistant Core — Combining RAG and Web Search

The core of the assistant is a LangChain agent with two tools: a ChromaDB retrieval tool (for answering from your uploaded documents) and a Tavily web search tool (for current information not in your documents). The agent automatically decides which tool to use based on the question — document questions go to ChromaDB, current events questions go to web search. This routing happens automatically through the LLM's reasoning.

Step 4: Adding Conversation Memory

Without memory, your assistant forgets the previous message every turn. LangChain's ConversationBufferWindowMemory maintains the last 10 messages in the context window, giving the assistant short-term conversational memory. For longer-term memory — the assistant remembering that you are preparing for GATE CSE and your weak area is Theory of Computation — store user preferences as a text file that gets prepended to every system prompt. For advanced memory using vector search (finding past relevant conversations), ChromaDB's semantic search can retrieve relevant past exchanges.

Step 5: Streamlit Interface

Streamlit provides a chat interface in less than 30 lines of Python. The key elements: st.chat_message for displaying user and assistant messages, st.chat_input for the message input box, and st.session_state.messages for maintaining conversation history across interactions without losing it on page refresh. Add a file uploader in the sidebar that triggers the ingest pipeline when a new document is uploaded — so your assistant immediately gains access to new documents without restarting.

Customising for Indian Student Use Cases

The baseline assistant becomes genuinely powerful when customised for your specific use case. For a study assistant: add your NCERT chapters and previous year papers to the document folder. Add a system prompt instruction: 'When answering academic questions, always cite the specific document and section your answer comes from.' For a coding assistant: add your project documentation and relevant textbooks. Instruct the assistant to always suggest code in Python and explain the logic, not just provide the solution. For a competitive exam assistant: add your handwritten notes (photographed and OCR-processed) and any PDF study materials.

This project — a personal AI assistant with document memory, web search, and a deployed chat interface — is exactly the kind of portfolio project that AI engineering recruiters at Indian product companies and GCCs are looking for in 2026. It demonstrates: RAG implementation, tool use, agent architecture, and deployment. Build it, deploy it on Streamlit Cloud or Render, write a clear README, and it becomes the most compelling item on your resume. LumiChats' Agent Mode can help you build and test the core components without any local setup during the initial development phase.

The most important thing you can add to this project for differentiation: an evaluation framework. Add a simple test file with 20 question-answer pairs that you know the correct answers to. Run these through your assistant and measure accuracy. Add this evaluation report to your GitHub README with a score and explanation of any failures. Showing that you can not only build AI systems but evaluate and improve them is what separates senior-level thinking from junior-level thinking in AI engineering.

What Your Personal AI Assistant Will Do

Architecture: The Components You Will Build

LLM backbone — Claude Sonnet 4.6 via Anthropic API or any model available through OpenRouter. Choose based on cost: OpenRouter provides access to multiple models including DeepSeek (cheapest) and Claude (highest quality) through one API key.
Document memory (RAG) — LangChain's document loading and chunking pipeline, HuggingFace embeddings (free, runs locally), and ChromaDB (free local vector store). Your uploaded notes and documents become searchable memory.
Web search tool — Tavily Search API (free tier available) or DuckDuckGo search wrapper in LangChain. Gives your assistant the ability to find current information it does not have in its context.
Conversation memory — LangChain's ConversationBufferWindowMemory to maintain recent context, or ChromaDB for long-term memory across sessions.
Interface — Streamlit (free, Python-native) for a simple web chat UI that you can run locally or deploy to Streamlit Cloud for free.