Fireworks.ai is a California-based artificial intelligence (AI) startup that’s providing a singular answer for enterprises. The AI agency doesn’t construct massive language fashions (LLMs) or basis fashions from scratch however fine-tunes open-source fashions and converts them into an Application Programming Interface (API) to assist companies deploy the AI capabilities in a seamless trend. The fine-tuning reduces the scope of the AI mannequin and focuses it on a selected performance. This permits them to cut back cases of AI hallucinations and enhance the capabilities of the mannequin considerably.
The AI agency was co-founded by Lin Qiao who additionally holds the seat of the CEO in the firm. After serving as the Senior Director of Engineering at Meta and dealing with AI frameworks and platforms, Qiao and her crew based the startup in October 2022, as per her LinkedIn profile. In a dialog with TechCrunch, she defined the enterprise mannequin of Fireworks.ai, highlighting the fine-tuning service they supply. She stated, “It can be either off the shelf, open source models or the models we tune or the models our customer can tune by themselves. All three varieties can be served through our inference engine API.”
This places the agency in a singular place the place whereas it’s not innovating at the basis mannequin stage, it’s bridging the hole between an LLM and a business-ready product that may be deployed seamlessly. With a main give attention to constructing APIs, Fireworks.ai lets its enterprise purchasers plug and play any open-source AI mannequin in its catalogue. As per the report, the firm additionally lets companies experiment with totally different AI fashions to decide on the one that matches their wants.
At current, the startup claims to include 89 open-source LLMs resembling Mixtral MoE 8x7B Instruct, Meta’s Llama 2 70B Chat, Google’s Gemma 7B Instruct, Stability AI’s Stable Diffusion XL, and extra. The AI agency gives the fashions in both serverless format that doesn’t require companies to configure {hardware} or deploy fashions, or as on-demand fashions which can be found for devoted deployments, served on reserved GPU configurations in keeping with enterprise wants.
For the on-demand format, Fireworks.ai has three cost plans — Developer, Business, and Enterprise — the place the Developer plan comes with a pay-per-usage construction and a fee restrict of 600 requests per minute, the Enterprise tier has customized pricing gives and limitless fee limits. The serverless format is billed at a per-token pricing plan the place totally different fashions, relying on whether or not they’re text-only, image-only, or multimodal, will fetch a unique worth.