About
Hi there! π
Iβm Jet Xu, an engineer passionate about building AI systems that understand code at scale. My work focuses on Repository Graph RAG architectures, LLM context engineering, and evidence-based code intelligence β making AI agents truly comprehend software repositories through structured knowledge graphs.
I believe the future of automated development lies in Code Graph RAG β where AI agents reason about codebases through persistent knowledge graphs capturing classes, methods, dependencies, and call relationships. Thatβs the core of what Iβm building.
π― Core Research & Engineering Focus
Repository Graph RAG Architectures
Building next-generation retrieval systems specifically for code understanding:
- Code Knowledge Graphs capturing structural relationships (classes, methods, imports, call graphs)
- Hybrid retrieval pipelines combining graph traversal, semantic search, and LLM-powered reasoning
- Multi-source context assembly from self repositories, public codebases, issues, and discussions
- Production-grade graph databases (ArangoDB) for efficient traversal and complex query patterns
LLM Context Engineering for Automated Development
Designing intelligent context retrieval that powers AI-driven development workflows:
- Evidence-based reasoning grounding AI insights in actual code snippets and architectural context
- Multi-stage retrieval with embedding models, rerankers, and confidence scoring
- Task-aware context selection adapting retrieval strategies based on development task complexity
- Repository pool caching for real-time performance in multi-agent systems
Privacy-Preserving Code Intelligence
Building secure architectures for enterprise code analysis:
- Zero-knowledge storage using cryptographic blind indexing (HMAC-SHA256)
- Ephemeral processing where code content never persists, only graph structures
- Federated learning approaches for cross-repository insights without data sharing
π Featured Projects
LlamaPReview β Evidence-Based AI Code Review
A GitHub App delivering Graph RAG-powered PR reviews with context retrieval and severity gating for low-noise, high-impact insights.
Core Architecture:
- π§ Context Retrieval Engine: Finds related, unchanged code to surface ripple effects early
- π Evidence-Backed Findings: Every high-priority issue ties to real code snippets with confidence scores
- π― Severity Gating: Deterministic filtering reduces false positives and review fatigue
- π Secure Knowledge Graph: Professional tier uses ArangoDB for persistent code structure analysis
Technical Innovations:
- Repository Graph RAG: Classes, methods, dependencies, and call graphs enable deep semantic understanding
- Zero-Knowledge Architecture: Code content is ephemeral; only cryptographic fingerprints persist
- Self-Adaptive Review: Learns from your teamβs review patterns over time
- Multimodal Analysis: Inline suggestions, architectural diagrams (Mermaid), and structured summaries
Current Adoption:
- π 3,000+ active repositories (32,000+ combined stars)
- π Free for public repos with automatic advanced analysis on complex PRs
- π Professional tier (launching soon) with full knowledge graph capabilities
llama-github β Foundation for Automated AI-Driven Development
An open-source Python library for efficient GitHub knowledge retrieval, designed as the context retrieval module for future automated development systems powered by multi-agent LLMs.
Vision: Automated AI-Driven Development
The library serves as a foundational component in a larger architecture (see diagram above) that enables:
- Multi-Agent Orchestration: LLM-based agents collaborating on complex coding tasks
- Retrieval-Augmented Development Loop: Continuous context retrieval from self repos and public knowledge
- Task & State Management: Intelligent task allocation, scheduling, and state tracking
- Development Tools Integration: File editing, testing, version control, and deployment automation
Current Capabilities:
- π Dual-Mode Retrieval: Self repositories + public GitHub resources (code, issues, discussions)
- π― LLM-Powered Query Analysis: Intelligent search strategy generation
- π Advanced Ranking: Embedding similarity + reranker scores + LLM relevance scoring
- β‘ Production-Ready: Async processing, repository pool caching, flexible authentication
- ποΈ Vector Database Integration: Efficient storage and retrieval at scale
Quick Start:
1
pip install llama-github
1
2
3
4
5
6
7
8
9
10
from llama_github import GithubRAG
github_rag = GithubRAG(
github_access_token="your_token",
openai_api_key="your_key"
)
# Retrieve context for development tasks
query = "How to implement async context managers in Python?"
context = github_rag.retrieve_context(query)
Future Development: While currently focused on LlamaPReview development, llama-github will evolve to support the full automated development vision β enabling AI agents to autonomously accomplish complex coding tasks through intelligent context retrieval and tool orchestration.
License: Apache 2.0 | Status: Available on PyPI |
π οΈ Technical Expertise
AI/ML & Code Intelligence
- Repository Graph RAG: ArangoDB for code knowledge graphs, graph traversal algorithms
- LLM APIs & Models: OpenAI GPT-4o, Anthropic Claude, Google Gemini, Mistral AI
- Embedding & Reranking: Jina AI embeddings and rerankers, sentence-transformers
- Code Analysis: AST parsing, static analysis, tree-sitter, semantic understanding
- Vector Databases: Efficient storage and retrieval for embedding-based search
Enterprise Architecture & Cloud-Native Systems
- Cloud-Native Architecture: Microservices, containerization, auto-scaling, high availability
- Cloud Platforms: Alibaba Cloud, AWS (compute, storage, serverless)
- High-Concurrency Systems: Distributed architectures, message queues, caching strategies
- Domain-Driven Design: Modular architecture, service boundaries, API design
Search & Data Technologies
- Semantic Search: Multimodal search (text, image, voice), embedding-based retrieval
- Graph Databases: ArangoDB for complex relationship modeling and traversal
- Databases: MySQL, MongoDB, Redis, distributed data systems
- Data Platforms: Customer data platforms, real-time processing, analytics
π What I Write About
On this blog, youβll find deep dives into:
- Repository Graph RAG: Architectures for building code knowledge graphs that power AI agents
- LLM Context Engineering: Strategies for intelligent context retrieval in automated development
- Code Intelligence Systems: Building AI that reasons about software architecture, not just syntax
- Privacy-Preserving AI: Cryptographic techniques for secure code analysis without persistent storage
- Production ML Systems: Lessons learned from deploying AI tools at scale (3,000+ repos)
- Multi-Agent Orchestration: Designing systems where LLM-based agents collaborate on complex tasks
I also share insights from enterprise architecture, digital transformation, and the intersection of AI with real-world software engineering challenges.
π Current Initiatives
Short-Term (Q4 2025)
- π Launching LlamaPReview Professional with ArangoDB-powered knowledge graphs for private repos
- π Publishing benchmarks: Code Graph RAG vs. vector-only RAG for software understanding tasks
- π Writing comprehensive guides on Repository Graph RAG architectures
Long-Term Vision: Automated AI-Driven Development
Building towards fully automated development systems (as illustrated in the architecture diagram) that combine:
- Multi-Agent Orchestration: LLM-based agents collaborating through intelligent task allocation
- Repository Graph RAG: Persistent code knowledge graphs enabling deep architectural reasoning
- Development Tools Integration: Seamless file editing, testing, version control, and deployment
- Continuous Learning: Self-adaptive systems that improve from developer feedback
- Privacy-First Design: Zero-knowledge architectures for enterprise adoption
The vision: AI agents that can autonomously accomplish complex coding tasks by intelligently retrieving context from self repositories and public knowledge, while maintaining human oversight through conversation management.
π‘ Philosophy
βThe best AI tools are evidence-based β they ground insights in real code structures, not hallucinations.β
I believe in building AI systems that:
- β Reason through graphs β Code understanding requires structural knowledge, not just embeddings
- β Ground in evidence β Every insight ties back to actual code snippets and relationships
- β Prioritize signal over noise β Severity gating and confidence scoring reduce false positives
- β Respect privacy β Cryptographic blind indexing, zero-knowledge storage, ephemeral processing
- β Learn continuously β Adapt to team patterns, improve through usage
π Background
With 15+ years spanning enterprise architecture and AI/ML engineering, I bring a unique perspective to building practical AI systems:
- AI/ML Engineering: Hands-on development of LLM-powered applications, Graph RAG systems, and AI agents
- Enterprise Architecture: Designing mission-critical platforms (CRM, CDP, e-commerce) serving millions of users
- Technical Leadership: Leading cross-functional teams (architecture, DevOps, QA) in multinational environments
- Open Source Contribution: Building developer tools (llama-github, LlamaPReview) used by the global community
My experience includes architecting cloud-native platforms with auto-scaling and high availability, implementing semantic search engines with multimodal capabilities (image, voice), and designing privacy-compliant data systems for regulated industries.
Iβve also led digital transformation initiatives from 0 to 1, including:
- China Unified Services Platform at Kering (luxury retailβs strategic business mid-platform)
- Digital Mid-Platform at Volvo Cars (20+ microservices, 10+ system integrations)
- Social Network Mobile App as founder/CEO (millions of users, ML-powered personalization)
This combination allows me to bridge cutting-edge AI research and production-ready systems that developers actually want to use.
π Recognition & Achievements
- π₯ First Award at National Olympiad in Informatics in Provinces (NOIP)
- β Top Contributor (Top 5%) at IBM for three consecutive years (2012-2014)
- π 3,000+ repositories actively using LlamaPReview
- π¦ Open-source maintainer of llama-github (Apache 2.0 license)
π« Get in Touch
Iβm always interested in discussing:
- Repository Graph RAG architectures for code intelligence
- Challenges in building production LLM systems with privacy guarantees
- Multi-agent orchestration for automated development
- Collaboration opportunities on open-source AI tooling
- Feedback on LlamaPReview and llama-github
Connect with me:
- πΌ LinkedIn: Jiantong Xu
- π GitHub: @jetxu-llm
- π§ Email: jiantong.xu@foxmail.com
- π Projects:
π Acknowledgments
My work builds on the shoulders of giants. Special thanks to:
- ArangoDB for the powerful multi-model database enabling complex graph queries
- Jina AI for excellent embedding models and reranker APIs
- LangChain for the foundational LLM application framework
- The open-source community for countless libraries, tools, and shared knowledge
π Fun Facts
- π¦ The βLlamaβ in my project names honors Metaβs Llama models and the open-source LLM movement
- π Double bachelorβs degree in Biological Sciences & Computer Sciences from Fudan University
- π― My goal: Build AI agents that can autonomously accomplish complex coding tasks through intelligent context retrieval
- π Firm believer that code understanding requires graphs, not just embeddings
Last Updated: October 2025
This page is a living document. Check back for updates on Repository Graph RAG research and automated development progress!