Together AI, a San Francisco, CA-based AI Acceleration Cloud company, has secured $305 million in Series B funding. The round was led by General Catalyst and co-led by Prosperity7, with participation from a group of investors, including Salesforce Ventures, DAMAC Capital, NVIDIA, Kleiner Perkins, March Capital, Emergence Capital, Lux Capital, SE Ventures, Greycroft, Coatue, Definition, Cadenza Ventures, Long Journey Ventures, Brave Capital, Scott Banister, and John Chambers.
This investment will accelerate its leadership as the preferred AI Cloud for building modern AI applications with open-source models. It will also help the company train custom models with our upcoming large-scale deployment of NVIDIA Blackwell GPUs.
Driving open-source AI adoption
Founded by Vipul Ved Prakash, Ce Zhang, Chris Ré, and Percy Liang in 2022, Together AI is at the forefront of the open-source AI revolution. The company’s AI Acceleration Cloud supports over 450,000 AI developers and enterprises, including major names like Salesforce, Zoom, SK Telecom, and The Washington Post. By integrating state-of-the-art open-source models with high-performance infrastructure, Together AI enables businesses to harness AI’s power without relying on proprietary solutions.
The shift toward open-source models like DeepSeek-R1 and Meta’s Llama highlights the industry’s move away from closed AI ecosystems. Together AI has positioned itself as a leading platform for deploying these models at scale, optimising them for NVIDIA GPUs, and delivering unmatched inference speeds through its advanced infrastructure.
The AI Acceleration Cloud platform
Together AI’s AI Acceleration Cloud is designed to serve the full AI lifecycle. The platform provides:
- Enterprise-grade inference solutions for efficient AI deployment.
- Model training and fine-tuning for advanced AI capabilities.
- Agentic workflows with built-in code interpretation.
- Synthetic data generation to enhance AI model performance.
Supporting over 200 open-source models across text, vision, audio, and code, the cloud platform is powered by proprietary inference engines and innovative research. Key technological advancements, such as FlashAttention-3 kernels and advanced quantisation, enable Together AI to deliver up to 3x faster inference speeds compared to existing hyperscaler solutions.
Expanding AI infrastructure
To support its growing user base, Together AI is rapidly expanding its infrastructure. The company has secured 200 MW of power capacity and is deploying NVIDIA Blackwell GPU clusters across North America. A partnership with Hypertec will facilitate the deployment of 36,000 NVIDIA GB200 NVL72 GPUs, further strengthening Together AI’s cloud capabilities.
Additionally, the launch of Together GPU Clusters featuring NVIDIA HGX B200 GPUs and the Together Kernel Collection ensures a 90% increase in training performance, reducing operational costs while maximising efficiency.
Research and innovation at the core
Together AI’s success is rooted in its commitment to research and innovation. Its research lab continues to pioneer breakthrough methods at the intersection of AI and systems research, with open-source contributions like Mixture of Agents, Medusa, Sequoia, Hyena, and Mamba driving innovation across the industry.
Under the leadership of Chief Scientist Tri Dao, known for creating FlashAttention, the research team continues to push AI performance boundaries while ensuring cost efficiency.
Recent milestones and fuure vision
In 2024, Together AI has reached several major milestones:
- Deployment of DeepSeek models in North American data centers with full privacy controls.
- Launch of the Together Enterprise Platform, now available on AWS Marketplace.
- Partnership with Cartesia to enable ultra-low latency voice AI.
- Acquisition of CodeSandbox, adding built-in code interpretation to the platform.
- Expansion of leadership, with Kai Mak joining as CRO and research expert James Zou enhancing the team’s AI capabilities.
With its latest funding, Together AI is poised to further democratise AI, making powerful open-source AI systems accessible to developers and enterprises worldwide. By combining innovation, efficiency, and transparency, Together AI continues to shape the future of AI infrastructure, ensuring that AI remains cost-effective, high-performance, and available to all.
“AI is transforming every industry, creating unprecedented efficiencies and enabling entirely new classes of products. We have built a cloud company for this AI-first world — combining state-of-the-art open source models and high performance infrastructure, with frontier research in AI efficiency and scalability,” said Together AI CEO Vipul Ved Prakash. “Our AI Acceleration Cloud uniquely provides organizations with the performance, security, and functionality required to train frontier models and build production-scale AI applications with incredible cost efficiency. With this investment, we will accelerate our mission to make open source AI accessible for AI developers and customers globally.”
“Vipul and team have built an incredible tech platform and business, emerging as a dominant player in AI infrastructure in less than two years. I was introduced to them when I invested in their first angel round and have witnessed firsthand the evolution of a product that, today, many Fortune 100 clients use to train, finetune, and run inference on models at scale,” said Marc Bhargava, managing director at General Catalyst. “Together AI’s mission to be the full stack AI cloud is truly inspiring, and General Catalyst brings the go-to-market expertise and ambition to supercharge this goal.”
The post Together AI raises $305M at $3.3B valuation to build next-gen AI cloud on NVIDIA GPUs appeared first on Tech Funding News.