This release is versions behind 0.13.0 — the latest version of @runt/ai.

@runt/ai@0.6.1
Built and signed on GitHub Actions
Built and signed on GitHub Actions

runtimed/runt

Works with

•JSR Score52%•

Published10 months ago (0.6.1)

@runt/ai

AI integration package for Runt runtime agents, providing OpenAI and Ollama clients with streaming responses, tool calling, and agentic conversation support.

Features

🤖 Multiple AI Providers: OpenAI and Ollama support with unified interface
🔄 Streaming Responses: Real-time token-by-token streaming with markdown rendering
🛠️ Tool Calling: Full support for notebook tools (create_cell, modify_cell, execute_cell)
🤖 Agentic Conversations: Multi-iteration conversations with tool execution
📦 Dynamic Model Discovery: Runtime-based model discovery with capability detection
🌐 Connection Management: Robust connection handling with error recovery
⚡ Performance: Efficient streaming and async processing
🔧 Configuration: Flexible configuration options
🧪 Testing: Comprehensive test suite with 95%+ coverage

Installation

# Install as part of Runt
npm install @runt/ai

# Or use with Deno
import { OpenAIClient, RuntOllamaClient } from "jsr:@runt/ai";

Quick Start

OpenAI Client

import { OpenAIClient } from "@runt/ai";

const client = new OpenAIClient({
  apiKey: process.env.OPENAI_API_KEY,
});

// Simple conversation
const messages = [
  { role: "user", content: "Explain Python list comprehensions" },
];

await client.generateAgenticResponse(messages, context, {
  model: "gpt-4",
  temperature: 0.7,
  enableTools: false,
});

Ollama Client

import { RuntOllamaClient } from "@runt/ai";

const client = new RuntOllamaClient({
  host: "http://localhost:11434",
});

// Check if ready
const isReady = await client.isReady();
console.log("Client ready:", isReady);

// Simple conversation
await client.generateAgenticResponse(messages, context, {
  model: "llama3.1",
  temperature: 0.7,
  enableTools: false,
});

Dynamic Model Discovery

The package supports runtime model discovery for both providers:

import {
  discoverAvailableAiModels,
  filterModelsByCapabilities,
} from "@runt/ai";

// Discover all available models
const allModels = await discoverAvailableAiModels();
console.log(`Found ${allModels.length} models`);

// Filter by capabilities
const toolCapableModels = filterModelsByCapabilities(allModels, ["tools"]);
const visionModels = filterModelsByCapabilities(allModels, ["vision"]);

// Group by provider
const modelsByProvider = new Map();
for (const model of allModels) {
  if (!modelsByProvider.has(model.provider)) {
    modelsByProvider.set(model.provider, []);
  }
  modelsByProvider.get(model.provider).push(model);
}

Model Capabilities

Models are automatically classified with these capabilities:

completion: Basic text completion
tools: Function/tool calling support
vision: Image understanding
thinking: Chain of thought reasoning
code: Code generation/understanding
multimodal: Multiple input types

AI Client Interface

Both clients implement the same core interface:

interface AiClient {
  generateAgenticResponse(
    messages: Message[],
    context: ExecutionContext,
    options: AgenticOptions,
  ): Promise<void>;

  isReady(): Promise<boolean>;
  discoverAiModels(): Promise<AiModel[]>;
}

Agentic Response Options

interface AgenticOptions {
  model?: string;
  temperature?: number;
  enableTools?: boolean;
  maxIterations?: number;
  onToolCall?: (toolCall: ToolCall) => Promise<string>;
  onIteration?: (iteration: number, messages: Message[]) => Promise<boolean>;
  interruptSignal?: AbortSignal;
}

Tool Calling

Both clients support the same tool calling interface:

await client.generateAgenticResponse(messages, context, {
  model: "gpt-4", // or "llama3.1"
  enableTools: true,
  onToolCall: async (toolCall) => {
    console.log(`Tool called: ${toolCall.name}`);
    console.log("Arguments:", toolCall.arguments);

    // Execute the tool
    const result = await executeNotebookTool(toolCall);
    return result;
  },
});

Available Tools

create_cell: Create new notebook cells
modify_cell: Edit existing cell content
execute_cell: Run code cells

Tool Registry

The package includes a built-in tool registry:

import { handleToolCallWithResult } from "@runt/ai";

const result = await handleToolCallWithResult(
  store,
  logger,
  sessionId,
  currentCell,
  toolCall,
);

Configuration

OpenAI Configuration

const openaiClient = new OpenAIClient({
  apiKey: process.env.OPENAI_API_KEY,
  baseURL: "https://api.openai.com/v1", // Custom base URL
  organization: "org-123", // Organization ID
  project: "proj-456", // Project ID
});

Ollama Configuration

const ollamaClient = new RuntOllamaClient({
  host: "http://localhost:11434", // Default Ollama host
  model: "llama3.1", // Default model
  headers: {
    "Custom-Header": "value",
  },
});

Environment Variables

# OpenAI
export OPENAI_API_KEY=sk-...
export OPENAI_BASE_URL=https://api.openai.com/v1

# Ollama
export OLLAMA_HOST=http://localhost:11434

Streaming Support

Both clients support real-time streaming:

// Streaming is enabled by default
await client.generateAgenticResponse(messages, context, {
  model: "gpt-4",
  // Content appears progressively in the execution context
});

Error Handling

Comprehensive error handling with contextual messages:

try {
  await client.generateAgenticResponse(messages, context, {
    model: "nonexistent-model",
  });
} catch (error) {
  // Error is automatically displayed in context
  console.log("Conversation failed:", error);
}

Interruption Support

Conversations can be interrupted gracefully:

const abortController = new AbortController();

// Start conversation
const promise = client.generateAgenticResponse(messages, context, {
  interruptSignal: abortController.signal,
});

// Interrupt after 5 seconds
setTimeout(() => abortController.abort(), 5000);

await promise; // Will stop cleanly

Ollama Setup

For local AI with Ollama:

# Install Ollama
curl -fsSL https://ollama.ai/install.sh | sh

# Start Ollama server
ollama serve

# Pull some models
ollama pull llama3.1
ollama pull mistral
ollama pull codellama

Model Management (Ollama)

// Check if model exists
const exists = await client.ensureModelExists("llama3.1");

// Get available models
const models = await client.getAvailableModels();
models.forEach((model) => {
  console.log(`${model.name} - ${model.details.parameter_size}`);
});

Integration with Runt

In Runtime Agents

import { OpenAIClient, RuntOllamaClient } from "@runt/ai";

// In your runtime agent
const aiClient = new OpenAIClient(); // or RuntOllamaClient

export async function executeAI(context: ExecutionContext) {
  const messages = buildConversationMessages(context);

  await aiClient.generateAgenticResponse(messages, context, {
    model: "gpt-4",
    enableTools: true,
    onToolCall: handleToolCall,
  });
}

Runtime Capabilities

Include AI model discovery in your runtime capabilities:

import { discoverAvailableAiModels } from "@runt/ai";

const capabilities = {
  runtimeType: "pyodide",
  availableAiModels: await discoverAvailableAiModels(),
  // ... other capabilities
};

Provider Comparison

Feature	OpenAI Client	Ollama Client
Streaming	✅	✅
Tool Calling	✅	✅
Model Management	❌	✅
Local Models	❌	✅
Cost	💰	🆓
Privacy	☁️	🏠
Speed	Fast	Variable
Setup	API Key	Local Install

API Reference

Functions

discoverAvailableAiModels(): Promise<AiModel[]> - Discover all available AI models
filterModelsByCapabilities(models: AiModel[], capabilities: string[]): AiModel[] - Filter models by capabilities
handleToolCallWithResult(store, logger, sessionId, currentCell, toolCall): Promise<string> - Execute tool calls

Classes

OpenAIClient - OpenAI API client with streaming and tool support
RuntOllamaClient - Ollama client with local model management

Types

AiModel - Model information with capabilities and metadata
ModelCapability - Capability types (completion, tools, vision, etc.)
ToolCall - Tool call structure with name and arguments
AgenticOptions - Configuration options for agentic responses

Troubleshooting

OpenAI Issues

"API key not found"

export OPENAI_API_KEY=sk-your-key-here

Rate limiting
- Reduce temperature for faster responses
- Implement exponential backoff
- Use different models for different tasks

Ollama Issues

"Ollama server not available"

ollama serve
curl http://localhost:11434/api/tags

"Model not found"

ollama pull llama3.1
ollama list

Connection refused

export OLLAMA_HOST=http://localhost:11434

Testing

# Run all AI tests
deno task test packages/ai/

# Run specific test files
deno task test packages/ai/test/ollama-client.test.ts
deno task test packages/ai/test/streaming-markdown.test.ts

# Run with coverage
deno task test:coverage

Contributing

Fork the repository
Create a feature branch
Add tests for new functionality
Run the test suite: deno task ci
Submit a pull request

License

MIT License - see the main Runt project for details.

Built and signed on

GitHub Actions

View transparency log

Add Package

deno add jsr:@runt/ai

Import symbol

import * as ai from "@runt/ai";

Import directly with a jsr specifier

import * as ai from "jsr:@runt/ai";

@runt/ai@0.6.1Built and signed on GitHub ActionsBuilt and signed on GitHub Actions