II-Agent - Open-Source Autonomous AI Assistant Platform

Overview

II-Agent V1 (launched February 4, 2026) is a production-ready, generalist open-source autonomous AI assistant designed for complex real-world workflows across research, coding, content creation, and data analysis. Developed by Intelligent Internet (ii.inc), it represents a full autonomous agent platform comparable to Manus, Genspark, and Kimi Claw.

Core Identity

  • Fully Autonomous: Interprets business objectives and executes multi-step tasks without step-by-step guidance
  • Generalist Design: Handles research, coding, design, content creation, and data analysis
  • Open-Source: Apache 2.0 license, fully auditable, deployable locally or via cloud
  • Multi-Model: Switch between Claude, Gemini, GPT-5 in single conversation thread
  • User-Sovereign: BYOK (Bring Your Own Keys), data stays under user control
  • Production-Ready: V1 release after extensive beta testing with thousands of builders

Performance Metrics

Benchmarks (V1):

  • Terminal Bench: 61.8% (top performer for open-source)
  • SWE Bench Pro: 45.1% (software engineering tasks)
  • GAIA Benchmark: 75.57% (general reasoning)
  • Terminal Bench 2: Top ranking

Core Capabilities & Workflows

1. Full-Stack Web Development

End-to-End Application Building:

  • Database schema to frontend
  • Automated code generation
  • Testing and optimization
  • Deployment integration (Vercel, NeonDB)

Runtime Options:

  • Local Mode (lightweight)
  • Docker (containerized)
  • E2B (managed cloud execution)

Features:

  • Live editing mode (adjust without regeneration)
  • Component library exports
  • Serverless deployment ready

2. Research & Fact-Checking

Dual Research Modes:

  • Fast Research: Quick answers (seconds)
  • Deep Research: Multi-step investigations with source triangulation

Capabilities:

  • Real-time literature review
  • Hypothesis generation
  • Structured note-taking
  • Rapid summarization
  • Source triangulation and verification
  • Integration with II-Researcher framework

3. Content Creation

Formats Supported:

  • Blog and article drafts
  • Lesson plans and educational content
  • Creative prose and storytelling
  • Technical manuals and documentation
  • Website creation and deployment
  • Slide presentations (automated design)
  • Storybooks with illustrations

4. Media Generation

Image Generation:

  • Nano Banana Pro
  • GPT Image 1.5
  • Imagen 4.0
  • High-fidelity asset creation

Video Generation:

  • Veo 3.1 integration
  • Frame control (start/end)
  • Resolution and aspect ratio options
  • Audio integration
  • Automated editing

5. Data Analysis & Visualization

Data Processing:

  • Data cleaning and transformation
  • Statistical analysis
  • Trend detection
  • Automated report generation
  • Charting and visualization
  • REPL environment for code execution

6. Code Execution & Development

Execution Environment:

  • Full terminal/shell access
  • Python and Node.js execution
  • Code interpretation (GPT-5 Code Interpreter)
  • Document manipulation (PDF, Excel, Word, PowerPoint)
  • Form filling and automation

7. Browser Automation

Playwright-Powered:

  • Form filling and submission
  • Screenshot capture
  • Element interaction
  • Web scraping and data extraction
  • End-to-end testing

8. Audio & Speech

Speech Features:

  • Audio transcription
  • Speech-to-text for prompts
  • Audio integration in videos

Platform Access & Interface

Web Platform: agent.ii.inc

II-Agent Chat:

  • Unified web interface
  • Multi-model selection in single thread
  • Pre-configured models: Gemini 3, Sonnet 4.5, GPT-5
  • Mobile-responsive design
  • Team collaboration support

No Setup Required:

  • Three pre-configured models (ready immediately)
  • Connect additional models via API keys
  • BYOK support for cost control

Open-Source Deployment

  • Full source code available (Apache 2.0)
  • Self-host on own infrastructure
  • Local model options
  • Customizable behavior

Key Features

Multi-Model Support

  • Work Across Models in single conversation with full context retention
  • Switch between Claude, Gemini, GPT-5 seamlessly
  • BYOK (Bring Your Own Keys) for model selection
  • Pre-configured models ready to use

Plan Mode

  • Visualize project plans before execution
  • Review and modify requirements
  • Adjust scope before build
  • Reduce wasted execution

Live Editing

  • Design-first interface for real-time refinement
  • Font changes, borders, layout adjustments
  • Website live preview
  • No need to regenerate from scratch

Universal Connectors

Integrated Platforms:

  • GitHub (repos, PRs, issues, commits)
  • Slack (team communication)
  • Gmail (email automation)
  • Google Workspace (docs, sheets, slides)
  • Notion (knowledge base)
  • Discord (community channels)
  • Dropbox (file storage)
  • Canva (design collaboration)

Custom Skills & Extensibility

  • Skills: Connect GitHub repos as reusable workflows
  • Model Context Protocol: Plug in custom workflows
  • One-click Actions: Execute specific processes
  • Developer-Focused: Transform conversations into automation

Bring Your Own Key (BYOK)

  • Full control over AI models
  • Pay only for chosen services
  • No vendor lock-in
  • Privacy - keys remain on your device

Technical Architecture

Agent Hierarchy

Partner Agent (runs on user’s device):

  • Interprets natural language goals
  • Maintains user communication
  • Drafts and oversees plans

Principal Agents (orchestration):

  • Execute specific project goals
  • Coordinate task execution
  • Report results

Associate Agents (task workers):

  • Execute discrete tasks
  • Web search, code generation, data processing
  • Short-lived and focused

All Agents Remain Open-Source

  • Auditable and reproducible
  • Can call proprietary endpoints when needed
  • Users can fork and customize
  • No hidden functionality

Pricing & Cost

Web Platform (agent.ii.inc)

BYOK Model: Users select and pay for preferred models

  • Claude/GPT/Gemini: Market rates (~$3-15 per 1M tokens)
  • Open-source models: Free
  • No platform subscription fees

Self-Hosted

Free Open-Source:

  • Apache 2.0 license
  • Deploy on own infrastructure
  • Local models: Zero cost
  • Full code transparency

Comparison with Similar Platforms

vs Manus

AspectII-AgentManus
Open-Source✅ Yes (Apache 2.0)❌ No
Autonomy✅ High✅ High
Multi-Model✅ Easy switching⚠️ Limited
User Sovereign✅ BYOK + cryptographic keys⚠️ Monica servers
Cost✅ BYOK (no platform fee)~$9/month
Research Focus✅ Strong (II-Researcher)✅ Very strong
Code Execution✅ Full terminal✅ Full terminal
Customization✅ Source code available❌ Proprietary

vs Genspark

AspectII-AgentGenspark
Autonomy✅ High (multi-step reasoning)⚠️ Moderate (more conversational)
Code Execution✅ Full terminal✅ Full (but via APIs)
Agent Swarm❌ Hierarchical only✅ Up to 100 parallel agents
Open-Source✅ Yes❌ No
Multimodal✅ Images, video, audio✅ Images, video, voice calls
Model Freedom✅ BYOK supported⚠️ OpenAI-dependent
Phone Calls❌ No✅ Yes (unique feature)
Cost✅ BYOK (free if local)~$0-20/month
Customization✅ Full (open-source)❌ Limited

vs Kimi Claw (Hosted OpenClaw + Kimi K2.5)

AspectII-AgentKimi Claw
Managed Service⚠️ Self-hosted or web✅ Fully managed
Setup Time2-5 min (web)1 click
Data Control✅ Full (if self-hosted)⚠️ On Kimi’s servers
Agent Swarm❌ Hierarchical✅ 100 parallel agents
Open-Source✅ Yes✅ OpenClaw only
Multi-Channel❌ Web/CLI only✅ WhatsApp, Telegram, Slack
Code Execution✅ Full terminal⚠️ Tool-based
Model Flexibility✅ Any model (BYOK)⚠️ Kimi K2.5 primary
Cost✅ Free (local models)~$0-10/month
Customization✅ Source code⚠️ Pre-configured

Use Cases

Individual Developers

  • Full-stack web development automation
  • Code refactoring and modernization
  • Bug investigation and fixing
  • Documentation generation

Researchers

  • Multi-step literature review
  • Hypothesis generation
  • Data analysis and reporting
  • Source triangulation

Content Creators

  • Blog post and article generation
  • Video and image creation
  • Presentation design
  • Documentation writing

Enterprises

  • Custom domain models
  • On-premises deployment
  • Workflow automation
  • Data analysis pipelines

Businesses

  • Market research and competitive analysis
  • Customer analysis and insights
  • Business report generation
  • Automation of repetitive tasks

Governance & Safety

Guardian Lattice

ii.inc implements oversight through:

  • Sentinels: Monitor execution logs
  • Advisers: Publish risk reports
  • Implementers: Emergency pause capability

User Sovereignty

  • Each user controls one agent via cryptographic key
  • No central control or data extraction
  • Alignment learned per individual user
  • Preferences never exposed upstream

Getting Started

Web Platform

  1. Visit agent.ii.inc
  2. Three models ready immediately (Gemini, Claude, GPT-5)
  3. Connect your own API keys (optional)
  4. Start using Plan Mode for complex tasks

Self-Hosted

  1. Clone from GitHub
  2. Choose runtime (Local, Docker, E2B)
  3. Use local models or connect API keys
  4. Deploy and customize

Strengths

Open-Source: Fully auditable, customizable, deployable
User Sovereign: BYOK, no vendor lock-in, data control
Multi-Model: Seamless switching between frontier models
Production-Ready: V1 release with strong benchmarks
Generalist: Excels across research, coding, content, analysis
Extensible: Custom skills, model context protocol support
Free Option: Local models cost nothing
Research Integration: II-Researcher deep research framework
Transparent Governance: Guardian Lattice oversight


Limitations

⚠️ No Agent Swarm: Hierarchical multi-agent only (vs Genspark’s 100 parallel)
⚠️ No Multi-Channel Messaging: Web/CLI only (vs Kimi Claw’s WhatsApp/Telegram)
⚠️ No Voice Calls: Can’t make phone calls (vs Genspark feature)
⚠️ Setup Complexity: Self-hosting requires more than one-click (vs Kimi Claw)
⚠️ Smaller Community: Newer than alternatives
⚠️ Less Brand Recognition: Not as well-known as OpenAI/Anthropic platforms


Ideal User Profile

Best For:

  • Open-source advocates wanting full transparency
  • Developers needing full code execution freedom
  • Privacy-conscious users wanting local deployment
  • Organizations with data residency requirements
  • Cost-optimized deployments (free with local models)
  • Multi-model experimentation (switch freely)
  • Custom automation and extensibility

Less Suitable For:

  • Non-technical users (more setup than Kimi Claw)
  • Those needing phone call capabilities (use Genspark)
  • Those wanting guaranteed 24/7 uptime (use Kimi Claw)
  • Those requiring pre-wired multi-channel messaging

See Also