AI / LLM System
Built an AI agent using a locally hosted LLM to interact with the internal helpdesk system.
An AI agent that can answer questions about helpdesk tickets, look up asset information, and perform basic actions in the helpdesk system — all running on local hardware with no external API calls.
The system has three main layers:
Frontend — A Next.js chat interface where users type queries in natural language.
Agent — A LangGraph-based agent orchestrating multi-step reasoning. It decides which tools to call, handles tool output, and composes a final answer.
Retrieval — A hybrid retrieval system combining BM25 keyword search and vector similarity search. BM25 handles exact term matches well (ticket IDs, names); vector search handles semantic similarity. Results are fused before being passed to the model.
Privacy requirements made external APIs a non-starter for this use case. Running everything locally also eliminated per-token costs and allowed fine-tuning on internal data without sending it to a third party.