Efficient AI inference for GenAI applications
OctoAI provides optimized inference endpoints for popular open-source models with automatic scaling.
Use OctoAI through Keystore with zero code changes. Keys are resolved from the vault and injected at request time.
import Keystore from "@keystore/sdk";
const ks = new Keystore({ agentToken: process.env.KS_TOKEN! });
ks.interceptAll();
// All requests to OctoAI's API are automatically
// intercepted and routed through the Keystore proxy.
// Real credentials are injected server-side.
const res = await fetch("https://api.octo-ai.com/v1/...", {
method: "POST",
headers: { "Content-Type": "application/json" },
body: JSON.stringify({ /* your payload */ }),
});
const data = await res.json();
console.log(data);Request access and our concierge team will provision credentials for you — usually within 24 hours. No setup on your end.
Request Access