* This blog post is a summary of this video.

Latest AI Advancements: Stable Code, GPT Bot, Audio Generation, and More

Table of Contents

Stable Code - New 3 Billion Parameter Model for Code Generation

Stability AI has released Stable Code, a new set of models for code generation and completion. Stable Code includes a 3 billion parameter model that rivals previous state-of-the-art models like Stockholder in capability, while being smaller and able to run on consumer GPUs.

Stable Code offers code completion, code generation, and the ability to chat about code. It can be tested out on Hugging Face. The prompt structure allows you to provide an instruction and code context, then generate a relevant response.

With performance rivaling the much larger PaLM and Stockholder models, Stable Code brings powerful AI-generated code within reach of individual developers.

Code Completion and Instruction Capabilities

Stable Code includes Stable Code Completion for code autocompletion and Stable Code Instruct for generating code from natural language instructions. It performs remarkably well considering its smaller size compared to previous models. The code completion features allow developers to generate relevant function names, arguments, and even full code blocks by providing some initial code context. The instruction model can produce code based on conversational prompts.

Consumer Hardware Compatibility

At just 3 billion parameters, Stable Code is small enough to run on consumer GPUs rather than requiring expensive specialized hardware. This makes AI-generated code more accessible to individual developers and small teams.

OpenAI's GPT Bot Identifies Itself for Web Crawling

OpenAI has revealed that its web crawling bot now identifies itself with a custom user agent string. Previously, the bot likely used random agents.

The new "GPT" user agent allows websites to identify OpenAI bots and block or restrict them if desired. This is a transparent, ethical move by OpenAI.

Anthropic's Cloud Instant 2.1 - Faster, Cheaper AI Model

Anthropic has released Cloud Instant 2.1, an improved version of its conversational AI model. The new model is faster, lower cost, and still highly capable.

Compared to version 1.1, Instant 2.1 has significantly improved response times and lower price per token. The model can be tested and compared to others like GPT-3.5 on services like Anthropic's Claude Playground.

New Audio Generation Framework - Audio LDM2

A new neural audio generation model called Audio LDM2 has been open sourced. It can generate 10 second audio clips from text prompts in about 1 minute.

Audio LDM2 code is available on GitHub. Examples like generating bird sounds, drums, and more can be tested on Hugging Face. This model could be useful for sound effect and audio generation in creative projects.

Input Streaming Allows Real-Time Text to Speech

Lever Labs has developed streaming capabilities that enable real-time text-to-speech using large language models. Previously, generating speech from LLM text required waiting for the full output.

By streaming text responses directly to Lever Labs' API, speech can be generated with minimal lag. This enables more natural conversational experiences with text-based agents.

Hyperwrite Browser Extension for AI Assistance

Startup Hyperwrite has launched an AI assistant browser extension powered by large language models. It aims to bring AI directly into browsers to enhance search, writing, and more.

After installing, the extension provides an interface to ask questions or give instructions. It then performs automated research and task completion using underlying AI systems. This demonstrates how LLM-based agents could become integrated into familiar software.

Magnetic for Coding by Natural Language

The open source Magnetic project allows developers to describe functions in natural language, which are then implemented by an AI system.

For example, a prompt like "Add more dudeness to the phrase" results in a function that turns text like "Hello there!" into "Hey dude what's up!". Data types of inputs and outputs can also be specified.

Human Script - Controlling Computers with Natural Language

Human Script is a new project that allows writing scripts for tasks like file operations purely in natural language instructions. The instructions are interpreted into code and executed.

For instance, a script to recursively process files in a directory can simply be described in plain English, without needing to write loops and logic manually. This demonstrates how close we are to controlling computers through conversational language.

API to AI Creates Assistants from API Specs

API to AI is a service that generates AI conversational agents from API specifications. It uses OpenAI to build assistants that can discuss and interact with any documented API.

The AI agents can take conceptual instructions like "book a train from Paris to Berlin" and produce valid API requests and code for the required workflow. This simplifies learning and using complex APIs.

Agent Bandage Evaluates AI Agents Over Diverse Tasks

As more AI agents emerge, evaluating their general capabilities is a challenge. Agent Bandage provides a framework for testing agents across 8 environments like operating systems, the web, and databases.

By assessing performance on many different types of tasks, Agent Bandage aims to produce a more rigorous and well-rounded analysis of strengths and weaknesses for current AI agents.

Chidori Enables Building Durable AI Agents

Chidori is a new open source runtime for creating AI agents using a reactive architecture. It is built in Rust and focuses on robust monitoring and modular composition of agent behavior.

Key goals of Chidori include better visibility into the internals of AI agents as they operate, as well as simplifying the agent development process. The reactive paradigm provides a promising path towards more transparent and trustworthy AI systems.

FAQ

Q: How can stable code be used for coding assistance?
A: Stable code offers code completion features that can help with writing code more efficiently. It also provides instructions related to working with code.

Q: What hardware is needed to run stable code?
A: Stable code is designed to run on standard consumer GPUs, making AI-assisted coding more accessible.

Q: How does GPT bot identify itself?
A: GPT bot reveals itself through a crawler user agent string, allowing websites to control its access if desired.

Q: What improvements does Cloud Instant 2.1 offer?
A: Cloud Instant 2.1 is faster, cheaper, and still highly capable compared to previous versions of Anthropic's model.

Q: What can Audio LDM2 generate?
A: Audio LDM2 is a new framework capable of creating realistic audio clips such as bird sounds and music.

Q: How does input streaming enable real-time TTS?
A: Streaming the text from LLMs allows it to be piped directly into speech generation, avoiding delays.

Q: What does the Hyperwrite browser extension provide?
A: Hyperwrite adds AI assistance features directly into any browser, such as research capabilities.

Q: How does Magnetic allow coding by language?
A: Magnetic leverages LLMs to generate functions purely from natural language descriptions of what they should do.

Q: What can Human Script accomplish?
A: Human Script allows controlling computer functions by writing scripts in plain language rather than code.

Q: How does API to AI create assistants?
A: It takes API specifications and builds conversational agents with ability to understand and leverage those APIs.

Q: Why is Agent Bandage valuable?
A: Agent Bandage provides a standardized framework for evaluating AI agents across diverse tasks.