Project Vajra

Next-Gen AI Inference

An open source inference engine built for
next-generation AI systems.

Coming in winter 2025

Built for the Future

Every design decision optimized for modern AI workloads

Performance First

Native C++ core, novel parallelism strategies, and intelligent caching. No compromises on speed or efficiency.

Designed for Scale

From single GPUs to massive clusters. Native support for multimodal processing and extreme context lengths.

Open by Design

Built by the community, for the community.

Build With Us

We're looking for system builders, performance engineers, and AI infrastructure developers to join our open source initiative and help build the next generation of AI systems.