Projects
Discover outstanding open source projects
OtherProject
DeepScientist
AI驱动的科研前沿探索平台,让更强大的AI推动科学边界,构建更强大的共享未来。支持自动化科研流程、论文分析和实验设计。
AI科研自动化研究论文分析
00
OtherProject
QMD - 本地文档搜索引擎
迷你CLI语义搜索引擎,专为本地文档、知识库和会议笔记设计。追踪当前最先进的检索方法,完全本地运行,无需云服务。17k+ Stars。
语义搜索CLI知识库
00
OtherProject
ByteRover CLI
编码Agent的便携式记忆层工具。为自主编码Agent(如Claude Code、Codex等)提供跨会话持久化记忆,支持项目上下文管理、任务状态追踪和知识图谱构建。
AI Agent记忆系统编码助手
00
OtherProject
TradingView MCP Server
AI驱动的TradingView MCP服务器,支持实时加密货币和股票筛选、技术指标分析、布林带智能分析和K线形态识别。兼容Claude Desktop等AI助手,支持币安、KuCoin、Bybit等多交易所。
MCPTradingView加密货币
00
AI ToolsProject
PaddleOCR
支持100+语言的强大OCR工具包,一键将PDF和图片转为结构化数据,轻量高效,是连接文档与LLM的桥梁
ocrdocument-aipdf
00
AI AgentProject
ChatDev 2.0
基于 LLM 的多 Agent 协作软件开发框架,模拟完整软件公司流程,自动化从需求到代码的全过程
multi-agentsoftware-devcollaboration
00
AI开发工具Project
Oh My Codex (OMX)
OpenAI Codex CLI 的工作流增强层,支持 hooks、agent teams、HUD 等功能扩展,让 Codex 更强大
codexagentworkflow
00
ModelProject
LiteRT-LM - Google On-Device LLM Inference Engine
On-device LLM inference engine by Google AI Edge team, built in C++ for efficient large language model execution on mobile and edge devices.
on-device-aillm-inferenceedge-ai
01305
ToolProject
FFF.nvim - Blazing Fast File Search for AI Agents
Blazing fast fuzzy file finder for AI agents and Neovim with built-in memory — reduces token usage by intelligently ranking files based on frecency, git status, and definitions. MCP protocol supported.
ai-agentfile-searchmcp
03503
ToolProject
Pi Mono - Full-Stack AI Agent Toolkit
AI agent toolkit by the creator of libGDX: coding agent CLI, unified multi-provider LLM API, TUI & web UI libraries, Slack bot, and vLLM pod management — everything you need to build AI agents.
ai-agentcoding-agentllm-api
331651
OtherProject
Google LiteRT-LM
Google出品的轻量级端侧LLM推理引擎,C++实现,专为移动设备和嵌入式平台优化的大语言模型运行时
端侧推理C++LLM Runtime
00
OtherProject
Google AI Edge Gallery
Google官方端侧AI模型体验平台,支持在手机上离线运行Gemma 4等开源大模型,完全本地推理、隐私安全、速度飞快
端侧AIGemma移动端
00
OtherProject
RAG-Anything
下一代全模态RAG框架,基于LightRAG构建,支持文本、图片、表格、公式、图表等多模态文档的统一处理与检索增强生成
RAG多模态文档处理
00
OtherProject
Pi Mono
全能AI Agent开发工具箱:编码Agent CLI、统一多LLM API、终端与Web UI库、Slack机器人、vLLM部署管理,一站式构建AI Agent应用
AI Agentcoding agentLLM API
00
OtherProject
mngr
Imbue AI 开源的 Agent 管理 CLI 工具,用于在终端中管理和编排多个 AI Agent 的运行。轻量级设计,适合开发者快速调度 Agent 任务。
agent管理CLI编排
10
OtherProject
Qwen-Code
通义千问团队开源的终端AI编码Agent,类似 Claude Code 的开源替代品。支持在终端中进行代码生成、编辑、调试等操作,基于 Qwen 模型。
qwen编码agent终端
10
OtherProject
GitHub Copilot SDK
GitHub 官方推出的多平台 SDK,用于将 Copilot Agent 能力集成到任意应用和服务中。支持多种语言和平台,提供标准化的 AI 编码助手接入方式。
copilotSDKgithub
00
OtherProject
Maths-CS-AI Compendium
成为顶级AI/ML研究工程师的系统化学习路线,覆盖数学、计算机科学和AI核心知识体系,含交互式内容。
AI学习机器学习教育
10
OtherProject
Compound Engineering Plugin
Every公司开源的复合工程插件,为Claude Code、Codex等AI编码工具提供团队协作增强,支持多Agent工程化工作流。
Claude Code编码工具团队协作
00
OtherProject
Last30Days Skill
AI Agent技能:自动跨Reddit、X、YouTube、HN、Polymarket等平台研究任意话题,并生成综合摘要报告。18k+ stars。
AI Agent研究信息聚合
00
OtherProject
Headroom
LLM应用的上下文优化层,自动压缩和管理上下文窗口,减少Token消耗同时保持输出质量。支持多种压缩策略。
LLM上下文优化Token节省
00
OtherProject
Compound Engineering Plugin
Every公司开源的复合工程插件,支持Claude Code和Codex,实现团队级多Agent协作开发工作流
Claude CodeCodex多Agent
00
OtherProject
fff.nvim
为AI Agent打造的最快最精准文件搜索工具包,支持Neovim/Rust/C/NodeJS,让Agent高效定位代码
AI Agent文件搜索Neovim
00
OtherProject
Koharu
基于机器学习的漫画翻译器,用Rust编写,自动识别漫画文字气泡并翻译,支持多语言
漫画翻译MLRust
10
OtherProject
Headroom
LLM应用的上下文优化层,智能压缩和管理上下文窗口,减少token消耗同时保持输出质量
LLM上下文优化token压缩
00
OtherProject
vercel-labs/just-bash
Vercel实验室出品的Bash for Agents工具,让AI Agent能够更安全高效地执行Bash命令,适合Agent自动化场景。
bashagentvercel
00
OtherProject
microsoft/agent-framework
微软开源的AI Agent构建、编排与部署框架,支持Python和.NET,提供多Agent工作流支持,适合企业级Agent应用开发。
microsoftagentframework
00
OtherProject
oh-my-openagent
最佳Agent编排框架(原oh-my-opencode),支持多Agent协作、插件系统和丰富的开发者工具,48000+ stars的热门项目。
agentorchestrationopen-source
00
OtherProject
everything-claude-code
Claude Code / Codex / Cursor 等编码Agent的性能优化系统,包含技能树、本能反应、记忆管理、安全策略和研究优先开发方法论。GitHub本周增长23000+ stars。
agentclaude-codeoptimization
00
OtherProject
Sim Studio
开源AI Agent编排平台,提供可视化界面来构建、部署和管理AI Agent工作流,是你AI团队的中央智能层
AI Agent工作流编排可视化
10
OtherProject
screenpipe
基于屏幕活动的AI Agent框架,通过记录和理解你的屏幕操作来驱动自动化Agent,让AI真正知道你在做什么并为你工作
AI Agent屏幕录制自动化
20
OtherProject
tokscale
AI编码Agent Token用量追踪CLI工具,支持Claude Code、Codex、OpenClaw、Gemini、Cursor等主流AI编码助手的token消耗监控,提供全球排行榜和2D/3D贡献图
CLIToken追踪AI编码
10
OtherProject
Daft
高性能 AI 数据引擎,支持图片、音频、视频和结构化数据的大规模处理,Rust 核心 + Python 接口
data-enginemultimodalrust
10
OtherProject
MLX-LM
Apple MLX 框架的 LLM 推理工具,让你在 Mac 上本地高效运行各种大语言模型
local-llmapple-siliconmlx
10
OtherProject
CocoIndex
面向 AI 的数据转换框架,支持增量处理,性能极高,适合构建 RAG、搜索和分析的数据管道
data-pipelineragincremental-processing
30
OtherProject
agent-browser
Vercel Labs 出品的 AI Agent 浏览器自动化 CLI 工具,让 AI 代理能够直接控制浏览器完成网页操作任务
browser-automationai-agentcli
20
OtherProject
OpenCode
开源AI编码Agent,支持终端和桌面应用,多语言界面,一键安装,兼容所有主流LLM提供商,是Claude Code和Cursor的开源替代。
coding-agentterminalopen-source
10
OtherProject
Oh My OpenAgent
4.8万星的开源Agent编排工具,支持多模型混用(Claude/Kimi/GLM/GPT/Gemini),自动任务分配和纪律执行,被称为比Claude Code更快的编码Agent。
coding-agentmulti-modelorchestration
10
OtherProject
LightRAG
港大EMNLP2025论文实现,轻量级高效RAG框架,基于知识图谱增强检索,支持图存储、重排序、多模态数据处理和OpenSearch后端。
RAGknowledge-graphretrieval
20
OtherProject
Microsoft Agent Framework
微软官方多Agent框架,支持Python和.NET双语言,提供图编排工作流、流式处理、检查点、人机交互等企业级能力,已整合Semantic Kernel和AutoGen。
multi-agentmicrosoftworkflow
20
OtherProject
Superpowers
一个 Agentic 技能框架和软件开发方法论,帮助 AI 编码 Agent 获得可复用的超能力技能包
AI Agent技能框架开发方法论
10
OtherProject
Repomix
将整个代码仓库打包成一个 AI 友好的单文件,方便喂给 Claude/ChatGPT/Gemini 等大模型进行代码分析
开发工具代码分析LLM
10
OtherProject
Sim Studio
构建、部署和编排 AI Agent 的可视化平台,支持拖拽式工作流设计,是 AI 团队的中央智能调度层
AI Agent工作流可视化
20
OtherProject
TimesFM
Google Research 开发的时间序列基础模型,预训练的通用时序预测模型,可零样本迁移到各种预测任务。
时间序列预测基础模型
20
OtherProject
Onyx
开源 AI 平台,支持对接所有主流 LLM,提供高级 AI 对话功能,内置文档检索、权限管理和企业级部署能力。
AI平台RAG企业级
10
OtherProject
Headroom
LLM 应用的上下文优化层,帮你智能管理和压缩 prompt 上下文,在有限 token 预算内最大化信息密度。
LLM上下文优化token管理
10
OtherProject
GitHub Copilot SDK
GitHub官方发布的Copilot Agent多平台SDK,可将Copilot能力集成到任意应用和服务中
GitHub CopilotSDKAI集成
10
OtherProject
Oh My OpenAgent
最强Agent增强框架(4.8万星),为各种编码Agent提供统一的插件系统、团队协作和HUD界面
Agent框架插件系统开发工具
10
OtherProject
Just Bash
Vercel Labs推出的Agent Bash工具,让AI Agent安全高效地执行Bash命令,为Agent提供标准化的终端交互能力
Agent工具Bash终端
10
OtherProject
mngr
Imbue开源的AI Agent管理CLI工具,像git管理代码一样管理编码Agent,支持SSH/Docker/Modal等多环境部署
Agent管理CLI工具多Agent
10
OtherProject
Qwen Code
通义千问团队开源的终端AI编码Agent,类似Claude Code的开源替代品,支持本地运行和多模型切换
AI Agent编码助手终端工具
20
AgentProject
Goose — Open Source Extensible AI Agent by Block
An open source, extensible AI agent by Block that goes beyond code suggestions — install, execute, edit, and test with any LLM
AI AgentOpen SourceCode Assistant
10
AgentProject
Microsoft Agent Framework — Multi-Agent Orchestration
Microsoft official open-source framework for building and orchestrating AI agents and multi-agent workflows with Python and .NET support
MicrosoftAgentMulti-Agent
28511
ai-modelProject
MLX-VLM — Vision Language Model Inference & Fine-tuning on Mac
Efficiently run and fine-tune Vision Language Models on Mac using Apple MLX framework, supporting multiple mainstream VLM architectures
MLXVLMMac
23362
OtherProject
last30days-skill
AI Agent 技能插件,可自动调研 Reddit、X、YouTube、HN、Polymarket 等多平台内容,生成有依据的综合摘要报告。
AI-Agent研究工具信息聚合
10
OtherProject
headroom
LLM应用的上下文优化层,帮助开发者智能压缩和管理上下文窗口,减少token消耗同时保持输出质量。
上下文优化token管理LLM工具
20
OtherProject
system_prompts_leaks
收录了 ChatGPT、Claude、Gemini、Grok 等主流AI产品的系统提示词泄露合集,定期更新。是学习顶级AI产品提示词设计的宝藏资源。
系统提示词prompt-engineeringAI安全
20
OtherProject
Maths-CS-AI Compendium
系统化的 AI/ML 研究工程师学习路线,覆盖数学基础、计算机科学核心和 AI 前沿,适合从零成长为顶尖 AI 研究工程师。
ailearningml
30
OtherProject
AIChat
全能 LLM 命令行工具,集成 Shell 助手、Chat-REPL、RAG、AI 工具和 Agent 功能,支持 OpenAI/Claude/Gemini/Ollama 等主流模型。
llmclirag
30
OtherProject
Koharu
基于机器学习的漫画翻译工具,用 Rust 编写,支持自动检测气泡文字、OCR 识别和翻译渲染,专为漫画场景优化。
mangatranslationml
30
OtherProject
ForgeCode
基于 Rust 构建的 AI 编程助手,支持 Claude、GPT、Gemini、DeepSeek 等 300+ 模型,在终端中实现智能结对编程。
aicoding-assistantrust
40
Dev ToolsProject
Compound Engineering Plugin
Every 公司开发的复合工程插件,适用于 Claude Code、Codex 等 AI 编码工具,实现多 Agent 协作工程化。
claude-codecodexengineering
30
AI DevelopmentProject
Honcho
AI Agent 记忆管理库,为构建有状态的 Agent 提供记忆持久化和个性化交互能力。
agent-memorystateful-agentpersonalization
30
AI ToolsProject
Chandra OCR
高精度 OCR 模型,擅长处理复杂表格、表单和手写文字,支持完整版式还原。
ocrdocument-aitable-extraction
30
ProductivityProject
OpenScreen
免费开源的屏幕录制与演示工具,Screen Studio 的开源替代品。无订阅、无水印,支持商业使用。
screen-recordingdemoopen-source
30
OtherProject
Skill Seekers
将文档网站、GitHub仓库和PDF自动转换为Claude AI技能包,支持冲突检测
Claude技能自动化
40
OtherProject
Oumi
一站式开源LLM微调、评测和部署平台,支持 Qwen3、DeepSeek-R1 等主流开源模型
微调LLM部署
50
OtherProject
PraisonAI
你的24/7 AI员工团队 - 低代码多Agent AI平台,支持自动规划、研究、编码,可投递到 Telegram/Discord/WhatsApp
多Agent低代码自动化
50
OtherProject
Microsoft APM
微软推出的 Agent 包管理器,为 AI Agent 生态提供标准化的技能包发现、安装和版本管理方案
Agent包管理微软
40
OtherProject
Local Deep Research - 本地深度研究引擎
本地化深度研究工具,SimpleQA基准达~95%准确率。支持Ollama等本地模型,搜索arXiv、PubMed等10+数据源,全程本地加密。
深度研究本地部署RAG
40
OtherProject
MLX-VLM - Mac本地视觉语言模型
在Mac上使用Apple MLX框架运行和微调视觉语言模型(VLM)。支持主流开源VLM,本地推理无需GPU服务器。
VLMApple MLXMac
40
OtherProject
FFF.nvim - AI Agent极速文件搜索
为AI编程Agent设计的极速文件搜索工具,内置记忆功能减少token消耗。支持MCP协议,可集成Claude Code、Codex等。
AI Agent文件搜索MCP
60
OtherProject
Microsoft APM (Agent Package Manager)
微软推出的AI Agent包管理器,让你像管理npm包一样管理和分发AI Agent技能包。支持发现、安装、版本管理Agent工具。
AI Agent包管理微软
40
OtherProject
Awesome Claude Code (社区版)
社区维护的 Claude Code 技能、钩子、斜杠命令和插件精选列表,36k+ Stars
Claude Codeawesome-listAgent技能
40
OtherProject
Cursor2API
将 Cursor 免费 API 转换为 OpenAI/Anthropic 兼容格式的代理服务,支持 Claude Code 工具调用和图片
API代理CursorOpenAI兼容
40
OtherProject
Supervision
Roboflow 开源的计算机视觉工具库,提供检测、跟踪、标注等可复用组件,37k+ Stars
计算机视觉目标检测Python
40
OtherProject
Chandra OCR
Datalab 出品的高精度 OCR 模型,能处理复杂表格、表单和手写体,支持完整版式还原,本周 2.4k Star 增长
ocrdocument-aitable-extraction
50
OtherProject
AI/ML Research Engineer Compendium
成为顶尖 AI/ML 研究工程师的完整学习路线图,涵盖数学、计算机科学和人工智能的核心知识体系,2.8k Star
learningaimachine-learning
50
OtherProject
Cloudflare Temp Email
基于 CloudFlare 的免费临时域名邮箱,支持收发邮件、附件、IMAP/SMTP 和 Telegram Bot 集成,7.9k Star
cloudflareemailself-hosted
50
OtherProject
Compound Engineering Plugin
Office 出品的 Claude Code/Codex 复合工程插件,支持多Agent团队协作、自动任务拆解和质量反馈循环,本周 1.4k+ Star
agentclaude-codecodex
40
OtherProject
Strix
开源 AI 安全黑客工具,自动发现和修复应用漏洞,23k+ Star 的热门安全项目
安全AI漏洞扫描
40
OtherProject
Honcho
为 AI Agent 构建有状态记忆的开源库,让 Agent 能记住上下文、用户偏好和历史交互
agent记忆开发框架
50
OtherProject
OpenScreen
开源屏幕录制工具,可创建精美 Demo 视频,无水印、免费商用,Screen Studio 的开源替代品
工具录屏开源
80
OtherProject
Oh My Codex
OpenAI Codex CLI 的增强框架,支持 hooks、Agent 团队、HUD 面板等扩展功能,让 Codex 不再孤军奋战
agentcodexcli
50
AIProject
TimesFM
Google Research开发的时间序列基础模型,专为时间序列预测设计,今日GitHub新增1100+ Star
时间序列预测Google
50
AIProject
VibeVoice
微软开源的前沿语音AI模型,支持语音合成、语音识别等多种语音任务,今日GitHub新增1000+ Star
语音AI微软TTS
60
ToolProject
Dimillian Skills
A collection of Codex skills shared by renowned iOS developer Dimillian, demonstrating how to build practical skill modules for AI coding assistants using Shell scripts.
codexskillsshell
52659
ToolProject
Claude Code Best Practice
A visual, example-driven guide to Claude Code — from basic concepts to advanced agents, with copy-paste templates for subagents, commands, skills, and orchestration workflows.
claude-codebest-practicedev-tools
50
AgentProject
Agent Lightning
Microsoft open-source framework to train and optimize AI agents with zero code change. Supports all major agent frameworks with RL, automatic prompt optimization, and SFT algorithms.
agent-trainingreinforcement-learningmicrosoft
70
OtherProject
Ruflo
Claude多Agent编排平台,支持智能Swarm部署、自主工作流协调和RAG集成,企业级架构。28K+ Stars本周热门。
Agent编排多AgentClaude
40
OtherProject
TaxHacker
自托管AI记账应用,利用LLM自动分析收据、发票和交易记录,支持自定义分类规则和多语言。3K+ Stars。
财务自托管AI记账
50
OtherProject
Honcho
为AI Agent构建有状态记忆的开源库,支持长期记忆、用户画像和个性化对话管理,轻量级易集成。
Agent记忆状态管理Python
70
OtherProject
Social Auto Upload
自动化上传视频到社交媒体平台:抖音、小红书、视频号、TikTok、YouTube、Bilibili,支持定时发布和批量操作。
自动化视频发布多平台
50
OtherProject
Chandra
高精度OCR模型,专门处理复杂表格、表单和手写体,支持完整版式还原。由Surya/Marker作者出品,8K+ Stars。
OCR文档处理表格识别
70
Dev ToolsProject
Fastfetch
高性能系统信息展示工具,neofetch 的现代替代品,用C语言编写,支持Linux/macOS/Windows等多平台
系统信息CLIneofetch
60
安全工具Project
Sherlock
跨400+社交平台的用户名搜索工具,输入用户名即可追踪其在各大社交网络的账号
OSINT社交媒体用户名搜索
50
AI ToolsProject
Claude How To
Claude Code 可视化实战指南——从基础概念到高级 Agent 编排,附带可复制的模板,15分钟快速上手
claude-code教程agent
60
OtherProject
last30days-skill
AI Agent 技能插件,能跨 Reddit、X、YouTube、HN、Polymarket 等平台研究任意话题近30天动态,生成有据可查的综合摘要。本周 GitHub 新增 10k+ star。
agentresearchskill
50
OtherProject
claude-subconscious
Letta AI 开发的 Claude Code 潜意识系统,为 Claude Code 添加持久记忆层,让它在多次会话间保持上下文连贯性。
Claude Codememory开发工具
40
OtherProject
supermemory
AI 时代的记忆引擎,提供极速、可扩展的 Memory API。可以为任何 AI 应用添加长期记忆能力,支持语义搜索和上下文管理。
memoryAPIAI
70
OtherProject
AI-Scientist-v2
SakanaAI 推出的 AI 科学家 v2,能够通过智能体树搜索自动完成研讨会级别的科学发现。支持自动文献综述、假设生成、实验设计与执行、论文撰写全流程。
AI科研自动化
50
AIProject
VibeVoice - 微软开源语音AI
微软推出的开源前沿语音AI,支持高质量语音合成与语音理解,GitHub Trending热门项目
语音AI微软TTS
60
AIProject
OpenBB - 开源金融数据平台
面向分析师、量化交易员和AI Agent的现代金融数据平台,支持多种数据源集成与分析
金融数据量化分析AI Agent
70
AIProject
Airi - AI虚拟伴侣框架
自托管AI虚拟伴侣容器,支持实时语音聊天、Minecraft和Factorio游戏陪玩,Web/macOS/Windows全平台支持,36k+ Star热门开源项目
AI伴侣语音聊天虚拟角色
70
OtherProject
test
test
80
OtherProject
Agentation
AI Agent 可视化反馈工具,实时展示 Agent 的执行过程、决策路径和工具调用链,让 Agent 行为透明可观测。
agentvisualizationobservability
70
OtherProject
Claude Agent SDK Python
Anthropic 官方发布的 Claude Agent SDK Python 版,提供构建 AI Agent 的标准化框架,支持工具调用、多轮对话和 Agent 编排。
claudeagent-sdkanthropic
90
OtherProject
Strix
开源 AI 安全黑客工具,自动发现并修复应用中的安全漏洞。利用 AI 模拟真实攻击者行为,进行自动化渗透测试。
securitypentestai-hacker
80
OtherProject
Chandra OCR
新一代 OCR 模型,擅长处理复杂表格、表单和手写内容,完整保留文档布局结构
["OCR""文档解析""表格识别"
60
OtherProject
Twenty CRM
开源现代 CRM 平台,社区驱动的 Salesforce 替代方案,基于 TypeScript 全栈构建,已获 42k+ Star
["CRM""开源""TypeScript"
70
OtherProject
Onyx
开源AI对话平台,支持连接所有主流LLM,内置RAG知识库、文档解析、权限管理,适合企业级AI助手部署。
AI平台RAG企业AI
100
OtherProject
insanely-fast-whisper
极速Whisper语音转文字工具,基于HuggingFace Transformers优化,速度比原版快数十倍,支持大文件批量转录。
语音识别Whisper转录
100
OtherProject
Deep-Live-Cam
实时换脸和一键视频深度伪造工具,只需一张照片即可实现实时人脸替换,支持多种AI模型和实时摄像头处理。
深度伪造换脸实时处理
100
AIProject
VibeVoice
微软开源前沿语音AI框架,支持60分钟长音频一次转录(含说话人、时间戳)、实时TTS、50+语言,已集成HuggingFace Transformers
语音AITTSASR
100
AIProject
Dexter
自主金融研究Agent,自动拆解复杂金融问题、调用实时市场数据、自我验证结果,像Claude Code但专为金融研究打造
金融分析Agent自动化研究
100
AIProject
AI Scientist v2
AI自动科研系统v2:无需人工模板,自主生成假设、跑实验、写论文,首篇AI全自主论文已被ICLR Workshop接收
AI科研自动化论文生成
90
OtherProject
chrome-devtools-mcp
Chrome 官方出品的 DevTools MCP 服务器,让 AI 编程助手直接操控浏览器开发者工具,实现自动化调试
ChromeDevToolsMCP
100
OtherProject
TradingAgents-CN
基于多智能体 LLM 的中文金融交易框架,TradingAgents 中文增强版,支持A股市场分析和交易策略
多Agent金融交易中文
110
OtherProject
claude-subconscious
给 Claude Code 添加「潜意识」记忆层,让 AI 编程助手在会话间保持上下文记忆,由 Letta AI 团队开发
Claude Code记忆上下文
140
OtherProject
claude-plugins-official
Anthropic 官方维护的 Claude Code 高质量插件目录,提供经过审核的插件集合,是学习插件开发的最佳参考
Claude Code插件Anthropic
120
OtherProject
vllm-omni
vLLM 多模态推理框架,支持文本、图像、音频等全模态模型的高效推理服务,是 vLLM 的下一代进化版本
多模态LLM推理vLLM
100
ai-agentProject
agentscope
可视化 Agent 构建与运行平台,支持可观测、可理解、可信赖的智能体开发,降低 Agent 开发门槛
agent-frameworkvisual-builderobservability
110
ai-modelProject
chandra
高精度 OCR 模型,擅长处理复杂表格、表单和手写体,支持完整版面还原,文档数字化利器
ocrdocument-aitable-recognition
100
ai-agentProject
oh-my-claudecode
Claude Code 多Agent协作编排框架,支持团队级任务分配与并行执行,让多个 Claude Code 实例协同完成复杂项目
claude-codemulti-agentorchestration
130
OtherProject
test
test
140
OtherProject
MoneyPrinterTurbo
利用AI大模型一键生成高清短视频的开源工具。支持自动生成视频脚本、配音、字幕和背景音乐,适用于批量内容创作场景。53000+ Star的热门项目,Python开发。
AI视频短视频自动化
110
OtherProject
OpenDataLoader PDF
AI就绪的PDF解析器,自动化PDF数据提取和可访问性处理。开源Java项目,专注于将PDF转换为结构化的AI训练数据,支持表格、图表和复杂布局的智能解析。本周GitHub热门,9400+ Star。
PDF数据提取AI
120
ToolProject
Pascal Editor - 3D Architectural Project Editor
Create and share 3D architectural projects. An open-source editor built with TypeScript for collaborative design.
3Darchitectureeditor
126432
ai-agentProject
Last30Days - AI Topic Research Skill
AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary.
researchagent-skilltopic-analysis
116262
ai-agentProject
Ruflo - Agent Orchestration Platform for Claude
The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows with enterprise-grade architecture, distributed swarm intelligence, RAG integration, and native Claude Code / Codex integration.
multi-agentorchestrationclaude
2725590
OtherProject
Claude Plugins Official
Anthropic 官方维护的高质量 Claude Code 插件目录,提供经过审核的插件集合,扩展 Claude Code 的能力边界。
Claude Code插件Anthropic
120
OtherProject
DeepAgents
LangChain 官方出品的 Agent 框架,内置规划工具、文件系统后端和子Agent生成能力,适合处理复杂的多步骤任务。
AgentLangChainLangGraph
180
OtherProject
Claude HUD
Claude Code 实时监控插件,可视化展示上下文使用量、活跃工具、子Agent状态和任务进度,开发者必备。
Claude Code开发工具插件
130
OtherProject
Project NOMAD
离线生存AI计算机,集成关键工具、知识库和AI能力,断网环境下也能提供信息支持和决策辅助。本周GitHub狂涨12000+ star。
AI离线生存工具
140
AgentProject
Last30Days Skill
AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web, then synthesizes a grounded summary
agent skillresearchmulti-platform
154862
ToolProject
Pascal Editor
Open-source online code editor with real-time collaboration
editorcollaborationopen-source
144143
ToolProject
Supermemory
Memory engine and app that is extremely fast and scalable. The Memory API for the AI era.
memory engineAPIAI
1618187
OtherProject
dimos
面向物理空间的智能体操作系统(Agentic OS)。支持用自然语言控制人形机器人、四足机器人、无人机等硬件平台,构建与物理传感器(摄像头、激光雷达、执行器)无缝协作的多Agent系统。
roboticsagent-oshardware
120
OtherProject
next-ai-draw-io
基于 Next.js 的 AI 绘图应用,将 AI 能力与 draw.io 图表深度集成。支持通过自然语言命令创建、修改和增强各类图表,让架构图和流程图的绘制效率倍增。
draw.ioAIdiagram
230
OtherProject
GitNexus
零服务器代码智能引擎——完全在浏览器客户端运行的知识图谱创建工具。只需拖入 GitHub 仓库或 ZIP 文件,即可生成交互式代码知识图谱,内置 Graph RAG Agent 用于代码探索。
knowledge-graphcode-analysisbrowser
120
OtherProject
MiroThinker
MiroMindAI 推出的深度研究Agent,专为复杂研究和预测任务优化。最新模型 MiroThinker-H1 在 BrowseComp 基准测试上达到 88.2 分,具备强大的网络浏览和信息综合能力。
deep-researchagentprediction
140
ToolProject
n8n-mcp - Build n8n Workflows with AI via MCP
A MCP server for Claude Code, Cursor, Windsurf to build n8n workflows from natural language.
mcpn8nautomation
1115830
ModelProject
minimind - Train a 26M GPT from Scratch in 2 Hours
Train a 26M-parameter GPT from scratch in just 2 hours with full pretraining, SFT, and RLHF pipeline.
llmtrainingeducation
150
AgentProject
DeerFlow - ByteDance SuperAgent Framework
An open-source SuperAgent harness by ByteDance that researches, codes, and creates with sandboxes, memories, tools and subagents.
agentmulti-agentresearch
3137318
OtherProject
Unsloth
统一的开源模型训练和推理 Web UI,支持 Qwen、DeepSeek、Gemma 等主流开源模型的本地微调和运行
模型训练微调开源模型
170
OtherProject
Learn Claude Code
从零到一构建一个类 Claude Code 的 Agent 编码工具,帮你理解 AI Agent 的核心原理:Bash is all you need
Agent教程Claude Code
150
OtherProject
Everything Claude Code
AI 编程助手性能优化系统,集成了 Skills、Instincts、Memory、Security 等模块,适用于 Claude Code、Codex、Cursor 等主流 AI 编程工具
Claude CodeAI编程Agent
140
AgentProject
TradingAgents - Multi-Agent Financial Trading Framework
Multi-agent LLM financial trading framework simulating real trading firm workflows with analysts, researchers and traders
financetradingmulti-agent
4435970
ToolProject
Project NOMAD - Offline Survival AI Computer
Self-contained offline survival computer packed with critical tools, knowledge, and AI for anytime, anywhere use
offline-AIsurvivalknowledge-base
167718
ToolProject
PentaGI - Autonomous AI Penetration Testing
Fully autonomous AI agent system for complex penetration testing tasks with multi-agent collaboration
securitypenetration-testingAI agent
165200
OtherProject
Context Hub
Andrew Ng(吴恩达)团队的开源项目,专注于AI Agent的上下文管理和组织,帮助开发者更好地构建和管理Agent的知识与工具。
contextagentandrew-ng
120
OtherProject
OpenViking
字节跳动开源的AI Agent上下文数据库,通过文件系统范式统一管理Agent所需的记忆、资源和技能,支持层级化上下文传递和自我进化。
contextmemoryagent
150
OtherProject
DeepAgents
LangChain官方出品的Agent框架,基于LangGraph构建,内置规划工具、文件系统后端和子Agent生成能力,适合处理复杂的多步Agent任务。
langchainagentplanning
120
OtherProject
Superpowers
一个Agent技能框架和软件开发方法论,帮助构建具有专业能力的AI Agent。已获超10万Star,是目前最火的Agent Skills开源项目之一。
agentskillsframework
130
AI ToolProject
MoneyPrinterV2
自动化在线赚钱流程的工具集,集成内容生成、发布和变现的完整工作流
自动化赚钱内容生成
170
AI ToolProject
vLLM Omni
vLLM团队推出的全模态模型推理框架,支持文本、图像、音频等多模态输入输出的高效推理
多模态LLM推理vLLM
130
AI ToolProject
Project NOMAD
离线生存计算机——集成本地AI、关键知识库和实用工具,断网环境下也能提供信息检索和AI辅助
离线AI生存工具本地LLM
160
OtherProject
Context Hub
吴恩达团队出品的上下文管理中心,帮助 AI Agent 高效管理和检索上下文信息
contextmemoryagent
150
OtherProject
DeepAgents
LangChain 推出的深度 Agent 框架,内置规划工具、文件系统后端和子Agent生成能力,处理复杂任务
agentlangchainplanning
150
OtherProject
Lightpanda Browser
专为 AI 和自动化设计的无头浏览器,用 Zig 编写,性能极高,适合大规模网页抓取和自动化任务
browserautomationheadless
150
OtherProject
Open-SWE
LangChain 出品的开源异步编程 Agent,支持自主规划和代码修改,可作为 SWE-bench 替代方案
agentcodinglangchain
150
AIProject
TradingAgents
多Agent LLM金融交易框架,模拟真实交易公司的多角色协作进行投资决策
tradingmulti-agentfinance
190
AIProject
TaxHacker
自托管AI记账应用,使用LLM分析收据、发票和交易,支持自定义提示词和分类
accountingself-hostedllm
200
AIProject
Newton Physics
开源GPU加速物理模拟引擎,基于NVIDIA Warp构建,专为机器人研究和仿真研究人员设计
physicssimulationrobotics
170
OtherProject
context-hub
Andrew Ng 团队出品的上下文管理工具,帮助 AI Agent 高效管理和检索项目上下文信息
contextmemoryagent
190
OtherProject
open-swe
LangChain 开源异步编程 Agent,可自主完成代码修改、Bug修复等软件工程任务
coding-agentlangchainswe
240
OtherProject
superpowers
Agent 技能框架与软件开发方法论,为 Claude Code 等 AI 编程工具提供可复用的技能包和最佳实践
agentskillsclaude-code
210
OtherProject
agency-agents
完整的 AI 代理团队框架 - 涵盖前端开发、社区运营、创意注入等多种专业角色,每个 Agent 都有独立人格、工作流程和可交付成果
agentframeworkmulti-agent
210
OtherProject
Arnis - Real-world Minecraft Generator
A Rust-written tool that can accurately recreate any real-world geographic location in Minecraft. Based on OpenStreetMap data, 10k+ Stars.
Minecraftmap generationRust
230
AIProject
Get Shit Done (GSD)
A lightweight meta-prompt and context engineering framework for Claude Code, a specification-driven development system. 35k+ Stars, a phenomenal project in AI programming.
Claude Codeprompt engineeringcontext engineering
190
AIProject
OpenDataLoader PDF
An AI-ready PDF parser for automated PDF accessibility processing. Supports intelligent table recognition, multilingual OCR, and structured data output. Open source and free, currently #1 on GitHub trending.
PDFAIdata processing
190
OtherProject
DeepAgents
An Agent coding framework created by the LangChain team, featuring built-in planning tools, filesystem backend, and sub-agent generation capabilities, suitable for handling complex multi-step Agent tasks. 15k+ stars.
agentlangchaincoding
190
OtherProject
OpenViking
An AI Agent context database open-sourced by ByteDance, unifying the management of Agent-required memories, resources, and skills through a filesystem paradigm, supporting hierarchical context and self-evolution. 15k+ stars.
agentcontext-managementmemory
170
OtherProject
Lightpanda Browser
A headless browser specifically designed for AI and automation, written in Zig for extreme performance. Suitable for Agent web interactions, data collection, and automated testing. 21k+ stars.
headless-browserautomationAI
190
OtherProject
BitNet
Microsoft's official 1-bit LLM inference framework, enabling efficient inference for ultra-low-bit quantized models, significantly reducing deployment costs and hardware requirements for large models. 35k+ stars.
LLMquantizationinference
200
OtherProject
MiroFish
A concise and versatile swarm intelligence prediction engine that predicts everything using collective wisdom. Based on Swarm Intelligence principles, supporting various prediction scenarios, with 34k+ Python stars.
swarm-intelligencepredictionpython
200
OtherProject
Hindsight
An Agent memory system that can automatically learn and evolve from interactions. Open-sourced by Vectorize, helping AI Agents establish persistent, retrievable memory capabilities.
Agent memoryself-evolutionvector storage
180
OtherProject
MiroFish MicroFish
A concise and versatile swarm intelligence engine capable of predicting everything. Based on swarm intelligence algorithms, suitable for trend prediction and multi-agent collaborative decision-making scenarios.
swarm intelligencepredictionmulti-agent
200
OtherProject
Promptfoo
A testing framework for Prompt/Agent/RAG, supporting red team testing and security scanning, capable of comparing performance across models like GPT, Claude, Gemini, with CLI and CI/CD integration support.
Prompt testingred team testingAI security
200
OtherProject
Lightpanda Browser
A headless browser specifically designed for AI and automation, written in Zig language, faster and lighter than traditional headless browsers, suitable for AI Agent web interaction scenarios.
headless browserAI automationZig
200
OtherProject
OpenViking
An AI Agent context database open-sourced by Volcano Engine, unifying the management of agent-required memories, resources, and skills through a filesystem paradigm, supporting hierarchical context delivery and self-evolution.
AI Agentcontext managementmemory system
210
LearningProject
Build Your Own X
Implement your favorite technologies from scratch! A curated collection of tutorials covering dozens of areas including 3D renderers, databases, compilers, operating systems, and more - the best practice for Feynman learning method.
learningtutorialprogramming
49350000
ToolProject
Claude HUD
A real-time status panel plugin for Claude Code, displaying context usage, tool activity, sub-agent status, and task progress without requiring additional windows.
claude-codeplugindeveloper-tools
255135
FrameworkProject
Deep Agents
An Agent framework by LangChain, ready out-of-the-box. Built-in planning, filesystem, shell, and sub-Agent capabilities, supporting MCP protocol extensions.
agentlangchainpython
2813521
OtherProject
promptfoo
An LLM application evaluation and red team testing CLI tool, supporting automated prompt evaluation, model comparison, security vulnerability scanning, and CI/CD pipeline integration.
evaluationred team testingPrompt testing
210
OtherProject
OpenViking
An AI Agent context database open-sourced by ByteDance's Volcano Engine, unifying the management of Agent memories, resources, and skills through a filesystem paradigm, supporting hierarchical context loading and visual retrieval.
context managementAgent memoryVolcano Engine
190
OtherProject
PageAgent
An in-page GUI Agent open-sourced by Alibaba, controlling web interfaces with natural language without browser extensions or screenshots. Pure JS implementation, enabling quick integration of AI Copilot into products.
GUI Agentweb automationAlibaba
220
OtherProject
MiroFish
A swarm intelligence prediction engine based on multi-agent technology, simulating society and predicting future trends through high-fidelity digital worlds, supporting scenarios like public opinion analysis and financial forecasting.
multi-agentpredictionswarm intelligence
220
AIProject
OpenViking - AI Agent Context Database
An AI Agent context database open-sourced by ByteDance, unifying the management of Agent memories, resources, and skills through a filesystem paradigm, supporting hierarchical context transfer and self-evolution.
agentcontextmemory
250
AIProject
claude-mem - Claude Code Cross-Session Memory Plugin
Automatically captures all operations in Claude Code programming sessions, compresses and summarizes them with AI, and injects them into future sessions, giving Claude cross-session memory capabilities. TypeScript implementation, 36k stars.
claude-codememoryplugin
240
AIProject
MiroFish - Swarm Intelligence Prediction Engine
A simple and versatile swarm intelligence engine that predicts everything using Swarm Intelligence. Python implementation, supports various prediction scenarios, and is easy to extend. 3200+ GitHub stars today.
swarm-intelligencepredictionpython
250
OtherProject
promptfoo
An LLM application evaluation and red team testing tool, supporting automated prompt testing, model comparison, security vulnerability scanning, and CI/CD integration, compatible with mainstream model providers like OpenAI, Anthropic, and Ollama.
Prompt testingred teamevaluation
200
OtherProject
nanochat
A minimalist LLM training framework created by Karpathy, enabling the full workflow from tokenization to pre-training, fine-tuning, evaluation, and inference on a single GPU node. Train a GPT-2 level model for just $48 and interact with it via a ChatGPT-style Web UI.
LLM trainingGPTKarpathy
330
OtherProject
OpenViking
An AI Agent context database open-sourced by ByteDance, using a filesystem paradigm to unify the management of Agent memories, resources, and skills, supporting hierarchical context loading and automatic session management, significantly reducing token consumption.
Agentcontext managementRAG
210
OtherProject
MiroFish
A swarm intelligence prediction engine based on multi-agent technology, simulating thousands of intelligent agents with independent personalities and memories in a high-fidelity parallel digital world to achieve accurate future trend predictions.
multi-agentpredictionswarm intelligence
210
FrameworkProject
OpenViking Context Database
An AI Agent context database open-sourced by ByteDance, unifying the management of Agent memories, resources, and skills through a filesystem paradigm, supporting hierarchical context distribution and self-evolution.
agentmemorycontext
210
FrameworkProject
MiroFish Swarm Intelligence Engine
A simple and versatile swarm intelligence engine capable of predicting anything. A prediction framework based on Swarm Intelligence, suitable for financial, weather, trend, and other prediction scenarios.
swarm-intelligencepredictionAI
200
ToolProject
Lightpanda Browser
A headless browser designed for AI and automation, written in Zig, with extremely high performance. Ideal for building AI-driven web scraping, testing, and automation workflows.
browserheadlessautomation
260
OtherProject
OpenViking
An AI Agent context database open-sourced by ByteDance's Volcano Engine, unifying the management of Agent memories, resources, and skills through a filesystem paradigm.
Agentcontext managementByteDance
200
OtherProject
promptfoo
Prompt/Agent/RAG testing and red team evaluation tool, supporting performance comparison and security scanning for mainstream models like GPT, Claude, Gemini, and Llama.
Prompt testingred teamevaluation
220
OtherProject
BitNet
Microsoft's official 1-bit LLM inference framework, significantly reducing deployment costs and hardware requirements for large models, enabling efficient LLM operation on edge devices.
LLMinference optimizationMicrosoft
210
OtherProject
MiroFish
A simple and versatile swarm intelligence prediction engine, using Swarm Intelligence for predictive analysis of various events, with 17k+ GitHub stars this week.
swarm intelligencepredictionAI
200
ToolProject
Lightpanda Browser
A headless browser designed for AI and automation, written in Zig, extremely lightweight and high-performance, ideal for crawlers and Agent web interaction scenarios.
headless browserautomationcrawler
10416325
FrameworkProject
OpenRAG
A one-stop RAG platform built on Langflow, Docling, and OpenSearch, offering full retrieval-augmented generation capabilities with a single package deployment.
RAGknowledge baseretrieval-augmented
252483
FrameworkProject
OpenViking
An AI Agent context database open-sourced by ByteDance, unifying the management of Agent memories, resources, and skills through a filesystem paradigm, supporting hierarchical context distribution and self-evolution.
AI Agentcontext managementmemory system
229556
OtherProject
notebooklm-py
Unofficial Python API for Google NotebookLM, supporting batch importing source files, generating audio podcasts/videos/mind maps, and providing advanced features not exposed in the Web UI.
NotebookLMPythonautomation
190
OtherProject
promptfoo
LLM application evaluation and red team testing tool. Supports automated prompt evaluation, model comparison, security vulnerability scanning, and integrates with CI/CD pipelines.
Prompt testingred teamevaluation
250
OtherProject
Hermes Agent
A self-evolving AI Agent developed by Nous Research, featuring built-in learning loops. Supports multiple platforms like Telegram/Discord/Slack, runs on $5 VPS, and is compatible with 200+ models.
AI Agentself-evolvingmulti-platform
200
OtherProject
BettaFish Micro-Opinion
From-scratch multi-agent public opinion analysis system covering 30+ global social platforms, automatically collecting and analyzing millions of comments. Features AI-driven monitoring, multimodal analysis, and Agent forum collaboration mechanisms.
opinion-analysismulti-agentdata-analysis
210
OtherProject
MiroFish
Swarm intelligence prediction engine using multi-agent technology to simulate future scenarios via high-fidelity digital worlds. Upload seed materials, describe prediction needs in natural language, and receive detailed reports with interactive digital sandboxes.
multi-agentpredictionswarm-intelligence
220
ToolProject
Hindsight
Agent memory system enabling continuous learning and knowledge extraction from interactions, allowing AI Agents to reuse accumulated experience.
Agentmemorylearning
213362
FrameworkProject
OpenRAG
One-stop RAG platform built with Langflow, Docling, and OpenSearch, offering out-of-the-box retrieval-augmented generation solutions.
RAGsearchLangFlow
211925
FrameworkProject
BitNet
Microsoft's official 1-bit LLM inference framework, drastically reducing deployment costs and hardware requirements while enabling efficient large-model execution on CPUs.
LLMinference-optimizationmicrosoft
4933231
OtherProject
notebooklm-py
Unofficial Python SDK for Google NotebookLM, supporting CLI and AI Agent integration for programmatic access to all NotebookLM features.
notebooklmgooglepython
220
OtherProject
Qwen-Agent
Official Agent framework for Alibaba's Qwen, supporting Function Calling, MCP protocol, code interpreter, RAG, and Chrome extensions.
agentqwenmcp
180
OtherProject
Hermes Agent
Intelligent Agent framework by NousResearch that evolves through usage, supporting tool calls and multi-turn conversations.
agentnous-researchpython
210
OtherProject
MiroFish
Versatile swarm intelligence engine for universal prediction. Uses swarm algorithms for diverse prediction scenarios.
swarm-intelligencepredictionpython
250
AIProject
MiroFish
Swarm intelligence prediction engine using Swarm Intelligence for universal forecasting. Simple yet powerful, trending on GitHub with nearly 3000 stars today.
swarm-intelligencepredictionai
200
AgentProject
Hindsight
Agent memory system by Vectorize that continuously learns from interactions, giving AI Agents growing long-term memory.
agentmemoryrag
230
AgentProject
Page Agent
Alibaba's open-source in-page GUI Agent for controlling web interfaces via natural language. Supports smart in-page operations, trending on GitHub with 1200+ stars today.
agentbrowser-automationalibaba
330
OtherProject
Hermes Agent
AI Agent by NousResearch that grows with you, featuring adaptive learning and skill expansion, starred 5k+ times.
AIAgentNousResearch
210
OtherProject
notebooklm-py
Unofficial Python API for Google NotebookLM, enabling programmatic access to all features (including Web UI-exclusive ones) and integration with Claude Code, Codex, and other AI Agents.
AINotebookLMAPI
200
OtherProject
Qwen-Agent
Official Agent framework for Alibaba's Qwen, supporting Function Calling, MCP, code interpreter, RAG, Chrome extensions, etc. Built on Qwen 3.0+.
AIAgentLLM
210
OtherProject
BettaFish Micro-Opinion
Multi-agent public opinion analysis tool for everyone, breaking information bubbles to reveal truth and predict trends. Built from scratch (no frameworks), starred 38k+ times.
AIAgentopinion-analysis
170
OtherProject
MiroFish
Versatile swarm intelligence engine for universal prediction. Uses swarm algorithms for cross-domain predictive analysis, starred 16k+ times.
AIswarm-intelligenceprediction
170
ToolProject
promptfoo - Prompt Evaluation & Red Teaming
CLI tool for evaluating and red-teaming LLM applications. Automates prompt testing, model comparison, security vulnerability scans, and integrates with CI/CD. MIT-licensed, supports production apps with millions of users.
prompt-testingred-teamevaluation
3912282
FrameworkProject
PageAgent - In-Page GUI Agent
Alibaba's open-source in-page GUI Agent that controls web interfaces via natural language. No browser extensions or screenshots needed—pure JavaScript implementation adds AI copilot to products with just a few lines of code.
gui-agentweb-automationalibaba
214249
ToolProject
MiroFish - Swarm Intelligence Prediction Engine
AI prediction engine using multi-agent technology to build high-fidelity parallel digital worlds where thousands of agents interact and evolve socially for accurate future projections. Supports financial signals, public opinion, novel endings, and more.
multi-agentpredictionsimulation
3315948
OtherProject
hermes-agent
A self-evolving AI Agent by Nous Research, featuring a learning loop that creates skills from experience, auto-improves, retrieves cross-session memories, and models users. Runs on any platform/environment.
Agentself-evolvingmulti-platform
190
OtherProject
Qwen-Agent
Agent development framework based on Qwen 3.0+, supporting Function Calling, MCP, Code Interpreter, RAG, Chrome extensions, etc. Serves as the backend for Alibaba's Qwen Chat.
Agent-frameworkQwenMCP
220
OtherProject
notebooklm-py
Unofficial Python API for Google NotebookLM, enabling programmatic access to all NotebookLM features including podcast generation, batch imports, quiz exports, and other capabilities not exposed in the Web UI.
NotebookLMPythonAPI
220
OtherProject
MiroFish
A versatile swarm intelligence engine that predicts everything through multi-agent simulation. Upload seed materials, describe prediction needs in natural language, and receive detailed reports with interactive high-fidelity digital worlds.
multi-agentpredictionsimulation
210
FrameworkProject
Nanochat - Karpathy's $100 ChatGPT
Andrej Karpathy's latest project: building the best ChatGPT alternative for just $100, featuring minimalist design and refined code.
ChatGPTKarpathyminimalist
210
ToolProject
Page Agent - Web Natural Language Control
An in-browser GUI Agent by Alibaba for controlling web interfaces with natural language, automating web operations without coding.
AI Agentbrowser automationAlibaba
233048
ToolProject
MiroFish - Swarm Intelligence Prediction Engine
A concise and versatile swarm intelligence engine for predicting anything. Python implementation based on Swarm Intelligence theory, supporting diverse prediction scenarios. Trending on GitHub with 4400+ stars today.
AIpredictionswarm intelligence
2713064
OtherProject
MiroFish
A concise and versatile swarm intelligence engine for predicting anything. Open-source Python project based on collective wisdom methodologies.
swarm intelligencepredictionPython
180
OtherProject
notebooklm-py
An unofficial Python API and Agent skills for Google NotebookLM, enabling full programmatic access to NotebookLM features via Python, CLI, and AI Agents.
NotebookLMGooglePython
180
OtherProject
WorldMonitor
A real-time global intelligence dashboard—AI-driven news aggregation, geopolitical monitoring, and infrastructure tracking in a unified situational awareness interface.
intelligence analysisnews aggregationgeopolitics
190
OtherProject
RuView
WiFi DensePose: Real-time human pose estimation, vital sign monitoring, and presence detection using ordinary WiFi signals—no cameras needed. Built with Rust for high performance and low latency.
WiFi sensingpose estimationprivacy-preserving
360
ToolProject
notebooklm-py
An unofficial Python API for Google NotebookLM, enabling programmable access to NotebookLM's podcast generation, summarization, and other features.
NotebookLMGoogleAPI
263969
FrameworkProject
Impeccable
A design language framework to make AI better at design, developed by Google engineers to help AI coding assistants generate more aesthetically pleasing UIs.
designAIUI
222494
AI & MLProject
MiroFish
A concise and versatile swarm intelligence engine for predicting anything. Python implementation based on swarm intelligence algorithms, trending on GitHub with 2200+ stars today.
swarm intelligencepredictionPython
278696
AI ToolsProject
claude-skills
A collection of 169 production-ready skills and plugins for Claude Code / OpenAI Codex / OpenClaw, covering engineering, marketing, product, compliance, and more.
210
AI ToolsProject
notebooklm-py
An unofficial Python API for Google NotebookLM, enabling programmatic notebook creation, source file uploads, and audio podcast summarization.
210
AI AppsProject
WorldMonitor
A real-time global intelligence dashboard—AI-driven news aggregation, geopolitical monitoring, and infrastructure tracking in a unified situational awareness interface. Top GitHub trending this week with 33k+ stars.
190
AI & MLProject
notebooklm-py
An unofficial Python API for Google NotebookLM, enabling programmable access to NotebookLM's summarization, Q&A, and podcast audio generation features, suitable for batch processing and automation workflows.
GoogleNotebookLMPython
283514
Dev ToolsProject
CyberStrikeAI
An AI-native security testing platform integrating 100+ security tools, intelligent orchestration engines, and role-based testing capabilities. Built with Go, supporting full security testing lifecycle management.
securityAIGo
222047
AI & MLProject
MiroFish
A concise and versatile swarm intelligence engine for predicting anything. Based on swarm intelligence algorithms, supports stock market, weather, sports, and other prediction scenarios. Python implementation, trending on GitHub with 399 stars today.
AIswarm intelligenceprediction
376303
OtherProject
Hermes Agent
A growing AI Agent framework by NousResearch, supporting autonomous learning and continuous evolution.
agentpythonnous-research
230
OtherProject
Scrapling
An adaptive web scraping framework supporting scenarios from single requests to large-scale crawling, with built-in anti-detection and intelligent parsing capabilities.
pythonweb-scrapingcrawler
210
FrameworkProject
Page Agent
A web-based GUI Agent by Alibaba for controlling web interfaces with natural language. No browser extensions, screenshots, or multimodal models required—pure JavaScript implementation, integrable into any web app with one line of code. Ideal for building AI Copilots and smart form-filling.
gui-agentweb-automationalibaba
32936
FrameworkProject
OpenAI Skills
A directory of Codex Agent Skills officially released by OpenAI, defining reusable instructions, scripts, and resource packages for AI Agents to automatically discover and execute tasks. Supports custom Skill creation and distribution.
openaicodexagent-skills
3512378
AI ToolProject
MiroFish
A swarm intelligence prediction engine based on multi-agent technology, creating high-fidelity parallel digital worlds where thousands of Agents with independent personalities and long-term memories interact freely, simulating social evolution to accurately predict future trends. Supports financial forecasting, public opinion analysis, and narrative simulation.
multi-agentpredictionsimulation
285185
OtherProject
Khoj
Your AI second brain, a self-hosted personal AI assistant. Supports retrieving answers from webpages or documents, building custom Agents, and deep research.
AI-assistantself-hostedRAG
210
OtherProject
Hello Agents
A Chinese tutorial by Datawhale for building AI Agents from scratch, guiding you from principles to practice step-by-step.
tutorialagentChinese
200
OtherProject
MobileAgent
A mobile GUI Agent series by Alibaba X-PLUG, supporting multimodal understanding of phone screens and automated operations, a benchmark project for GUI automation.
agentmobileGUI
210
OtherProject
Hermes Agent
An adaptive AI Agent framework by NousResearch, supporting tool usage, memory management, and continuous learning, touted as an Agent that grows with you.
agentNousResearchtool-use
180
OtherProject
HVE Core
An enterprise-grade Prompt engineering framework by Microsoft, powering GitHub Copilot with constraint-driven AI workflows. Built-in RPI (Research→Plan→Implement) methodology, specialized Agents, and JSON Schema validation, suitable for individuals to large teams.
MicrosoftCopilotPrompt Engineering
200
OtherProject
Webnovel Writer
A Claude Code-based web novel writing assistant using RAG + entity graphs to address AI's memory and hallucination issues, supporting 2-million-word serials with built-in reader engagement tracking and visual Dashboard.
AI writingClaude CodeWeb novels
230
OtherProject
SEO Machine
A Claude Code-based SEO long-form writing workstation with specialized commands like /research, /write, /optimize and 26 marketing Skills, integrating GA4 and Search Console data to mass-produce high-ranking blog content.
SEOClaude CodeContent creation
210
OtherProject
test
test
220
OtherProject
Khoj
A self-hosted AI second brain supporting web search, local document Q&A, custom Agents, scheduled automation, and deep research, compatible with mainstream LLMs. ⭐ 33k
ai-assistantself-hostedrag
240
OtherProject
AReaL
Huawei inclusionAI's open-source LLM reinforcement learning framework focused on enhancing reasoning and Agent capabilities, featuring speed and flexibility. ⭐ 4k
reinforcement-learningllmreasoning
390
OtherProject
hello-agents
Datawhale's 'Building Agents from Scratch' tutorial systematically explains Agent principles and practices, ideal for beginners. Chinese tutorial with code examples. ⭐ 25.4k
agenttutorialchinese
230
OtherProject
MobileAgent
Alibaba X-PLUG's mobile GUI Agent series supporting multimodal visual perception and autonomous smartphone operations, suitable for automated testing and mobile assistant scenarios. ⭐ 7.9k
agentmobilegui
210
SecurityProject
Trivy
An all-in-one security scanner for containers, K8s, code repositories, and cloud environments, detecting vulnerabilities, misconfigurations, and secret leaks. Essential for DevSecOps.
Security scanningContainer securityDevSecOps
6032778
AI & MLProject
AReaL
Ant Group's open-source LLM reasoning and Agent reinforcement learning framework for rapid training and enhanced model inference capabilities.
Reinforcement learningLLMReasoning
323930
WritingProject
SEO Machine
A Claude Code-based SEO long-form content workstation that automatically researches, writes, analyzes, and optimizes top-ranking blog content for any business scenario.
SEOClaude CodeContent generation
241271
ToolProject
Codex CLI
OpenAI's terminal programming Agent that writes and runs code via natural language in the command line
OpenAITerminalAgent
210
ToolProject
Claude Code
Anthropic's terminal-based AI programming tool for collaborative coding with Claude in the command line
TerminalAI programmingAnthropic
200
ToolProject
Windsurf
An AI-first code editor deeply integrated with AI for smart coding and project understanding
Code editorAI-firstSmart coding
180
ToolProject
Roo Code
A VS Code AI programming extension supporting multiple models and customizable workflows
VSCodeAI programmingMulti-model
200
ToolProject
MindSearch
An open-source AI search engine framework simulating human thinking for multi-step search and reasoning
AI searchMulti-step reasoningMind simulation
200
ToolProject
STORM
Stanford's open-source AI long-form writing system that automatically researches and generates Wikipedia-level articles
Long-form writingAutomated researchStanford
200
FrameworkProject
AG2 (AutoGen v2)
Microsoft's next-gen AutoGen version with more flexible multi-Agent conversations and orchestration
MicrosoftMulti-AgentConversation orchestration
180
FrameworkProject
Mastra
A TypeScript AI Agent framework supporting workflows, RAG, and multi-Agent orchestration
TypeScriptAgentWorkflow
200
FrameworkProject
MCP (Model Context Protocol)
Anthropic's open-source protocol standardizing connections between AI and external tools
ProtocolAnthropicTool integration
180
ToolProject
Stagehand
An AI-powered web automation SDK open-sourced by Browserbase, controlling browsers with natural language
Web automationNatural languageBrowser
200
ToolProject
Browser Use
Enables AI Agents to browse the web like humans, performing clicks, inputs, and navigation
Browser automationAgentWeb operations
180
FrameworkProject
Smolagents
A lightweight Agent library open-sourced by HuggingFace, featuring concise APIs for building tool-calling Agents
HuggingFaceLightweightAgent
200
FrameworkProject
Pydantic AI
A Pydantic-based Agent framework for type-safe AI application development
PydanticType safetyAgent
300
AgentProject
OpenHands
An AI-driven software development Agent capable of writing code, fixing bugs, and managing projects
Software developmentAgentBug fixing
240
FrameworkProject
LangServe
A one-click tool for deploying LangChain chains and Agents as REST APIs
DeploymentLangChainREST API
220
ToolProject
Unstructured
Extract clean text and metadata from unstructured documents such as PDF, Word, HTML and more
Document extractionunstructuredETL
210
ToolProject
Verba
The open source RAG chatbot, powered by Weaviate, supports multiple embeddings and LLM
RAGchatbotWeaviate
190
ToolProject
Khoj
Open source personal AI assistant that can search notes, documents and the Internet, and supports self-hosting
Personal AInote searchself-hosting
200
ToolProject
Perplexica
Open source AI search engine, Perplexity replacement, supports multiple search modes
AI searchPerplexity alternativeopen source
180
ToolProject
Danswer
Enterprise-level AI search and Q&A assistant that connects internal company documents and tools
Enterprise searchQ&Ainternal documentation
230
FrameworkProject
E2B
Cloud infrastructure that provides a secure sandbox environment for AI Agents to safely execute AI-generated code
sandboxsecurity executioncloud infrastructure
200
FrameworkProject
Composio
Provides a platform for AI Agent to integrate 200+ tools, connecting GitHub, Slack, databases, etc.
Tool integrationAgent200+ tools
230
AgentProject
OpenDevin
An open source AI software engineering agent platform that can write and debug code like human developers
AI developerAgentcode writing
250
FrameworkProject
Gradio
Quickly build a web demonstration interface for machine learning models, supporting multiple input and output types
Web presentationML modelinterface construction
240
FrameworkProject
Streamlit
Use Python to quickly build a web framework for data and AI applications, which can be deployed with just a few lines of code
Web frameworkPythondata application
230
FrameworkProject
Chainlit
Python framework to quickly build AI chat applications, supporting streaming, multi-modality and human-in-the-loop
Chat applicationPythonstreaming
600
ToolProject
Promptfoo
LLM evaluation and red team testing tools to systematically evaluate prompt quality and model security
Evaluationred team testingprompt optimization
230
FrameworkProject
Guardrails
Add structured validation and safety guardrails to LLM output to prevent hallucinations and formatting errors
Safety guardrailverificationanti-hallucination
180
FrameworkProject
Instructor
A structured data extraction library that allows LLM to reliably output JSON that conforms to the Pydantic model
Structured outputPydanticJSON
190
AgentProject
Sweep
AI-driven GitHub Bot automatically converts issues into PRs and handles bug fixes and small features
GitHub Botautomatic PRbug repair
210
ToolProject
GPT4All
Run open source LLM’s desktop chat app on any device with complete offline privacy
Offlinedesktop applicationprivacy
180
ToolProject
LM Studio
Discover, download and run local LLM desktop applications on your laptop
Desktop applicationlocal LLMeasy to use
180
ToolProject
Stirling PDF
Powerful self-hosted PDF tool that supports AI-assisted PDF editing and conversion
PDF toolsself-hostedediting and conversion
210
ToolProject
Immich
Self-hosted Google Photos alternative that supports AI face recognition and object search
Photo managementAI recognitionself-hosting
180
ToolProject
Screenshot to Code
Upload screenshots to automatically generate front-end code, supporting HTML/Tailwind/React/Vue, etc.
Screenshot to codefront-endautomatic generation
180
ToolProject
Manim
3Blue1Brown’s Python engine for producing mathematical animations can be used to generate teaching animations with AI assistance.
Mathematics animationteachingPython
240
ModelProject
Style-TTS 2
Human-level text-to-speech model with naturalness scores close to real people
TTSnatural speechhuman level
200
ModelProject
Xtts
Coqui's open source multi-language TTS model supports voice cloning and only requires 6 seconds of reference audio.
TTSvoice cloningmulti-language
200
ModelProject
CogVideo
CogVideoX, the open-source Vincent video model of Zhipu, supports multiple resolutions and durations.
Wisdom spectrumWensheng videomulti-resolution
200
ModelProject
Open Sora
An open source project that reproduces Sora video generation, supporting video generation from text and images
Soravideo generationopen source
220
ToolProject
LobeHub TTS
Open source high-quality TTS toolkit, supporting multiple TTS engines and voices
TTSspeech synthesismulti-engine
200
ModelProject
Stable Diffusion 3
Stability AI's latest Vincentian graph model is based on the MMDiT architecture and has greatly improved quality.
Stability AIVincent DiagramMMDiT
190
ModelProject
Llama 3
The third generation Llama model of Meta open source, version 8B and 70B, multi-language support
MetaLlamaopen source
250
ModelProject
Phi-3
Microsoft's open source small language model, 3.8B parameter performance surpasses all models of the same level
Microsoftsmall modelefficient
190
ModelProject
Gemma 2
Google's open source lightweight large model, version 2B and 9B, suitable for local deployment
Googlelightweightlocal deployment
200
ModelProject
DeepSeek-V3
In-depth exploration of open source MoE large models, 671B parameters and 37B activations, performance comparable to GPT-4
DeepSeekMoEopen source
180
ModelProject
Mistral
Mistral AI's open source, efficient large language model series achieves powerful performance with small parameters
Mistralefficientsmall parameters
250
ModelProject
Qwen2.5
Alibaba Tongyi Qianwen open source model, with multiple specifications to choose from and leading Chinese and English capabilities
AlibabaTongyi QianwenChinese and English
190
ModelProject
Yi
Zero One Thousand Open Source's large language model series supports multiple specifications and multi-modalities.
Zero One Thinglarge modelmulti-modal
180
ModelProject
GLM-4
Zhipu AI’s open source bilingual dialogue model supports 128K context and multi-modal understanding.
Wisdom spectrumbilingual128K
190
ModelProject
Kling AI
Kuaishou’s open source video generation model supports Wensheng videos and Tusheng videos.
Video generationKuaishouWensheng Video
180
ModelProject
MuseV
Virtual human video generation framework based on diffusion model, supporting unlimited length videos
Virtual humanvideo generationdiffusion model
180
ToolProject
Pika
AI video generation tool, convert text and pictures into high-quality short videos
Video generationWensheng videoTusheng video
210
ModelProject
FLUX
Black Forest Labs’ open source Vincentian graph model, quality comparable to Midjourney
Vincentian diagramimage generationopen source
190
ModelProject
Segment Anything 2
Meta's universal image and video segmentation model, zero-shot segmentation of any object
Metaimage segmentationvideo segmentation
180
FrameworkProject
Supervision
A library of computer vision tools that simplify visualization and analysis of object detection, segmentation and tracking
Computer visiontarget detectiontracking
180
ToolProject
Surya
Multilingual OCR and document analysis tools, supporting 90+ languages with extremely high accuracy
OCRmulti-languagedocument analysis
220
ToolProject
Marker
Fast and accurate PDF to Markdown conversion tool that supports tables, formulas and code blocks
PDFMarkdownconversion
360
ToolProject
MinerU
One-stop open source data extraction tool, convert PDF to Markdown and JSON, support OCR
Data extractionPDFOCR
340
ToolProject
Docling
IBM's open source document parsing tool converts PDF, DOCX, etc. to Markdown, supporting tables and formulas
Document parsingPDFMarkdown
310
ToolProject
Activepieces
Open source Zapier alternative that supports AI-driven automated workflow building
AutomationZapier alternativeworkflow
190
ToolProject
n8n
Workflow automation platform supports AI nodes and integrates LLM into automated processes
workflowautomationintegration
240
ToolProject
Bolt.new
AI full-stack development tool produced by StackBlitz, build complete applications through conversations in the browser
Full stackbrowser developmentStackBlitz
230
ToolProject
Cline
The independent programming agent in VS Code can create files, execute commands, and use browsers
VSCodeProgramming AgentAutonomous
220
AgentProject
Replit Agent
AI-driven full-stack application generator that generates complete deployable web applications through dialogue
Full stackapplication generationWeb
310
AgentProject
SWE-agent
Princeton’s open source automatic bug repair agent can automatically locate and fix code issues on GitHub
Automatic bug fixingGitHubPrinceton
200
AgentProject
Devika
Open source AI software engineers, similar to Devin, can understand requirements and write code independently
AI engineerDevin replacementcode generation
180
AgentProject
TaskWeaver
Microsoft's open source code-first Agent framework is good at handling data analysis and automation tasks
MicrosoftAgentdata analysis
190
ToolProject
Quivr
Your second brain, use RAG to turn documents, web pages and notes into a conversational knowledge base
Second brainknowledge baseRAG
220
FrameworkProject
DB-GPT
AI native data application development framework, supporting multi-model data-driven applications of AWEL and Agent
Data applicationdatabaseAgent
200
ToolProject
MaxKB
Enterprise-level knowledge base Q&A system based on large models, ready to use out of the box, supports multiple document formats
Knowledge baseenterprise levelQ&A
190
ToolProject
LibreChat
Enhanced version of ChatGPT clone, supporting multiple AI service providers, plug-ins and multi-modal interactions
ChatGPT clonemultiple service providersplug-ins
190
ToolProject
Lobe Chat
An open source high-performance AI chat framework that supports multiple models, plug-in systems and knowledge bases
Chat frameworkmultiple modelsplug-ins
210
ToolProject
InvokeAI
Professional-grade AI image generation workstation, providing canvas editing, ControlNet and workflow management
Image generationcanvas editingprofessional level
200
ToolProject
Fooocus
Stable Diffusion’s minimalist interface generates high-quality images without adjusting parameters.
Image generationStable Diffusionminimalist
180
ModelProject
Bark
Suno's open source text-to-audio model supports speech, music, background sound effects and non-verbal sounds.
audio generationTTSmusic
230
ModelProject
Fish Speech
Open source TTS model supports multi-language zero-sample speech cloning with latency as low as 150ms
TTSvoice cloningmulti-language
190
ModelProject
ChatTTS
Text-to-speech model optimized for dialogue scenarios, supports Chinese and English, and has natural rhythm
TTSspeech synthesisdialogue
190
ModelProject
MiniCPM
Wall-facing intelligent open source end-to-side large model series, 2B parameters are comparable to 7B effects, and supports mobile phone deployment
Device-side modellightweightmobile phone deployment
240
ToolProject
Crawl4AI
Asynchronous web crawler optimized for AI and LLM, automatically extracting structured data
Crawlerasynchronousstructured data
190
ToolProject
Firecrawl
Convert web pages into clean Markdown data available to LLM, supporting crawling of the entire website
Web crawlingMarkdowndata cleaning
190
AgentProject
ExUI (Open Interpreter)
Let LLM run code locally, control the computer through natural language, and execute Python, Shell, etc.
Code executionnatural languagelocal execution
190
ToolProject
llama.cpp
LLM inference engine implemented in pure C/C++, supports CPU and a variety of hardware, is lightweight and efficient
C++CPU reasoninglightweight
200
ToolProject
TensorRT-LLM
NVIDIA's LLM inference optimization library achieves ultimate inference performance on GPU
NVIDIAinference optimizationGPU
180
FrameworkProject
Semantic Kernel
Microsoft's open source AI orchestration SDK integrates LLM with traditional programming languages and supports plug-ins and plans.
MicrosoftSDKorchestration
190
FrameworkProject
LlamaIndex
LLM data connection framework connects private data with large models to build knowledge-enhanced AI applications
Data connectionRAGknowledge enhancement
190
ToolProject
Aider
The AI programming assistant in the terminal can directly edit code files through dialogue and supports Git integration.
TerminalAI programmingGit
200
ToolProject
Continue
Open source AI code assistant IDE plug-in, supports VS Code and JetBrains, and can be connected to any LLM
IDE plug-incode assistantVSCode
210
ToolProject
SillyTavern
AI role-playing chat frontend for advanced users, supporting multiple LLM backends and rich customization options
Role-playingchatfront-end
470
FrameworkProject
Mem0
Intelligent memory layer adds persistent personalized memory to AI applications, supporting user-level and session-level memory
memorypersonalizationcontext
210
FrameworkProject
Haystack
End-to-end NLP/LLM application construction framework, supporting RAG, Q&A, semantic search, etc.
NLPRAGsearch
220
ToolProject
OpenRouter
AI model routing platform, one API to access hundreds of AI models, automatically select the optimal model
Model routingAPImulti-model
200
ToolProject
LiteLLM
Unified LLM API agent, calling 100+ LLMs in OpenAI format, including Azure, Anthropic, Ollama, etc.
API proxymulti-modelunified interface
200
FrameworkProject
Phidata
Build an AI assistant framework with memory, knowledge and tools, supporting multi-modality and multi-Agent
AI assistantmemoryknowledge base
220
AgentProject
AutoGen
Microsoft's open source multi-Agent dialogue framework supports automatic dialogue and collaboration between agents to solve problems.
Microsoftmulti-Agentdialogue
200
AgentProject
CrewAI
A multi-agent collaboration framework that allows AI agents to work together like a team to complete complex tasks
Multi-agentcollaborationAgent
220
ToolProject
Tabby
Self-hosted AI code assistant, an open source alternative to GitHub Copilot, supports local deployment
Code completionCopilot replacementself-hosting
190
AgentProject
GPT Engineer
Describe the requirements in one sentence, and AI automatically generates complete code projects to support iterative improvements.
Code generationautomationfull stack
200
ToolProject
Cursor
AI-driven code editor, based on VSCode, integrates GPT-4 and Claude for intelligent code completion and dialogue
Code editorAI programmingVSCode
210
ToolProject
AnythingLLM
All-round AI document assistant, supports multiple LLM and vector databases, and provides a complete RAG pipeline
Document AssistantRAGmulti-model
190
FrameworkProject
Langflow
Visual framework for RAG and multi-Agent AI applications, supporting drag-and-drop construction and Python customization
VisualizationRAGAgent
190
ToolProject
Flowise
Drag-and-drop LLM application building tool, visually orchestrate LangChain process, and build AI workflow with zero code
Low codevisualizationLangChain
220
ToolProject
Jan
Open source ChatGPT desktop replacement, 100% offline operation, supports Llama, Mistral and other models
Desktop applicationofflineChatGPT replacement
210
ToolProject
LocalAI
Open source OpenAI API alternative, supports local running of LLM, image generation and audio transcription, completely offline
Local deploymentAPIoffline
190
ToolProject
PrivateGPT
100% privatized document question and answer system, no Internet required, supports local running of LLM for document interaction
PrivacyRAGlocal deployment
190
AgentProject
AutoGPT
The autonomous AI agent framework allows GPT-4 to complete complex tasks autonomously, automatically setting goals, decomposing steps and executing them.
AI AgentautonomyGPT-4
200
ToolProject
A 3D generation project of NVIDIA: Edify 3D, which can generate high-quality, production-ready 3D models in 2 minutes, and can also generate complex 3D scenes.
NVIDIA's Edify 3D project can generate high-quality 3D models and complex scenes in 2 minutes, providing fine geometric meshes, clean topology, standardized UV maps, 4K textures and a complete PBR material system. Compared with other text to 3D tools, Edify 3D performs well in terms of shape and texture quality, efficiency and scalability, and supports generating 3D text or images.
3D generationmodel
190
ToolProject
Cool, an AI tool for quickly generating 3D content: DiffSplat, which can generate corresponding 3D content based on text or images in 1-2 seconds
DiffSplat is an AI tool for quickly generating 3D content. It can generate high-quality 3D content based on text or images within 1-2 seconds. It supports ControlNet for fine control and is very suitable for rapid iteration or real-time generated scenes.
3D generationAI
190
ToolProject
A high-fidelity single image to 3D generation model: TripoSG, 1.5 billion parameters, capable of processing complex image features
TripoSG is a high-fidelity single image to 3D generation model with 1.5 billion parameters. It combines linear trajectory modeling of correction flow and Transformer architecture, which can stably model 3D object shapes and process photos, cartoons and sketches.
3D generationmodel
200
ToolProject
Tencent’s Hunyuan 3D generation model has just released the latest version: v2.5. The total parameter volume has been increased from 1B to 10B, and the number of effective patches has increased by more than 10 times.
Tencent Hunyuan 3D generation model v2.5 is released. The number of parameters is increased from 1B to 10B, the number of effective patches is increased by 10 times, supports multi-view PBR veneers, 4K high-definition textures and fine-grained bump maps, optimizes the bone skinning system, and improves the efficiency of 3D animation generation.
3D generationmodel
190
ToolProject
Tencent has recently open sourced a universal 3D reconstruction model: HunyuanWorld-Mirror, adding diversified inputs. Videos/multi-view images can also generate 3D worlds with one click.
Tencent has open sourced the HunyuanWorld-Mirror model, which supports the generation of high-quality 3D worlds from videos and multi-view images. It uses a feed-forward architecture to generate multiple three-dimensional representations at the same time, and is suitable for scenarios such as e-commerce and the digitization of museum cultural relics.
3D generationopen sourcemodel
200
ToolProject
Cool, Alibaba has released a 3D digital human model that can be animated from a single image in a few seconds: LHM, which supports real-time rendering and posture control.
Alibaba launched LHM, a 3D digital human model that can be animated and reconstructed from a single image in a few seconds. It supports real-time rendering and posture control, with realistic effects and excellent detail processing.
3D digital human generationmodel
190
ToolProject
A fast generation of ultra-large-scale 3D city models: GaussianCity, 60 times faster than the existing CityDreamer solution
GaussianCity is a tool for quickly generating ultra-large-scale 3D city models. The speed reaches 10.72 FPS, which is 60 times faster than existing solutions. It supports borderless city generation, uses BEV-Point as an intermediate representation, and combines with a space-aware decoder to achieve high-quality rendering.
3D city generative modelmodel
210
ToolProject
A 3D human body model generation project: StructLDM, which can generate a complete 3D human body and also supports partial editing and modification, such as changing clothes, changing postures, etc.
StructLDM is an open source project that can generate a complete 3D human body model, supports the generation of different poses and perspectives, as well as the editing and modification of clothing and body shapes, allowing functions such as partial try-on and identity exchange.
3D human body generationmodel
220
ToolProject
A system for reconstructing 3D human skeleton and mesh models from a single image: HSMR, which can convert a photo of an ordinary person into a 3D human body model with biomechanically accurate bone structure
The HSMR system is capable of reconstructing biomechanically accurate 3D human skeleton and mesh models from a single image, suitable for complex poses and poor shooting angles, and provides the means to create pseudo-realistic datasets to train other models.
3D human body model generationmodel
200
ToolProject
Tencent has open sourced its 3D world generation model: HunyuanWorld 1.0, which can generate an immersive, explorable, and interactive 3D world.
From the effect point of view, the generated picture quality is high, the details are rich, and it has certain physical simulation capabilities. At the same time, it supports literary works and graphic works, with diversified styles, and the generated scenes can be immersed in 360° roaming. Supports mesh export and is seamlessly compatible with existing computer graphics pipelines
3D world generationopen sourcemodel
220
ToolProject
Li Feifei World Labs’ generative multimodal world model: Marble has just been released. A picture/video/text prompt/3D layout can generate a high-fidelity 3D world.
Marble is a generative multi-modal world model released by Li Feifei World Labs, which supports the generation of high-fidelity 3D worlds through pictures, videos, text prompts and 3D layouts. Users can perform secondary creations, modify materials and styles, and export Gaussian sputtering and triangular meshes, which is suitable for games and film and television production.
3D world generationmodel
190
ToolProject
A 3D/4D scene generation project from Microsoft and the National University of Singapore: GenXD, which can handle both 3D and 4D scene generation.
GenXD is a 3D/4D scene generation project jointly developed by Microsoft and the National University of Singapore. It can handle 3D and 4D scene generation at the same time. The generated video follows the camera trajectory, keeps the object shape consistent, supports 3D format conversion, and is suitable for virtual reality, games and architectural visualization. The model supports the processing of real-world and synthetic datasets, providing single/multi-view 3D and 4D generation.
3D/4D generative models
270
ToolProject
A framework for generating controllable 3D4D scenes from a single image: DimensionX, which can precisely control the generation process and generate a 360-degree omnidirectional perspective
DimensionX is a framework that can generate controllable 3D4D scenes from a single image, supporting 360-degree omnidirectional viewing angles and multiple camera control modes. ST-Director enables precise control of the spatial and temporal dimensions of video generation, enabling the generation of coherent multi-view video sequences while maintaining scene consistency and supporting spatiotemporal changes.
3D/4D generative models
200
ToolProject
A Google project to generate 4D scenes (dynamic 3D) from monocular video: CAT4D
Google's CAT4D project can generate dynamic 3D scenes from monocular video, supports separate control of camera perspective and time, and provides three output modes. This project outperforms other models in reconstruction effects, is suitable for real or generated videos, and is mainly used for movie special effects. Currently only supports Chrome 130+ browsers.
3D/4D generative models
210
ToolProject
Very cool. Tencent has just open sourced its latest 3D generation model: version 2.0 of Hunyuan 3D. The generated geometric structure is more detailed, the texture color is richer, and the performance exceeds existing open source and commercial models.
Tencent has open sourced version 2.0 of Hunyuan3D, which has a more detailed geometric structure and rich texture colors, supports high-resolution 3D asset generation, includes geometry generation and texture synthesis models, and provides a convenient production platform Hunyuan3D-Studio.
3D/4D generative modelsopen sourcemodels
200
ToolProject
An AI web browsing automation tool: Stagehand, which can help you perform operations such as clicking, typing, searching, and natural language interaction on web pages.
Stagehand is an AI web browsing automation tool that supports clicks, inputs, searches and other operations, and can interact through natural language. It has visual recognition capabilities, can analyze the structure of web pages, extract structured data, conduct automated testing and record operation logs, and is lightweight and configurable.
open sourceAI
190
ToolProject
A "Visio" plug-in for Claude Code: cc-wf-studio, drag and drop to create an automated workflow for Claude Code with zero code
cc-wf-studio is a code-free plugin designed for Claude Code that supports complex workflow creation, including branching, looping, and user interaction. Users can design workflows through the graphical interface and export them to .claude format files. The built-in AI optimization function can automatically adjust nodes and lines.
No code platform
230
ToolProject
Given a video link, the AI tool can automatically generate a hand-drawn storyboard: clipsketch-ai, and automatically adapt to the copywriting of the self-media platform to tweet with one click.
clipsketch-ai is an AI tool that can automatically generate hand-drawn storyboards based on video links and adapt to the copywriting of self-media platforms. It supports batch generation and optimization of storyboards. Users can upload custom characters to integrate into the scene. It supports parsing video links of Bilibili and Xiaohongshu.
Self-media toolsAI
370
ToolProject
Alibaba Digital Human Framework: HRM2Avatar, a mobile phone can produce 90–120FPS, 2K resolution high-fidelity digital human selfies with a single camera
The HRM2Avatar framework supports the generation of high-fidelity digital people at 120 FPS on iPhone 15 Pro Max and 90 FPS on Apple Vision Pro, with 2K resolution, and can restore the character’s geometry, facial features, clothing folds, and light and shadow changes in real time
digital man
180
ToolProject
An AI programming agent task Kanban tool: Vibe Kanban, helping you to increase AI programming efficiency by 10 times
Vibe Kanban is an AI programming agent task Kanban tool that can significantly improve programming efficiency. It supports task management of multiple Agents, monitors status and results in real time, automatically creates git branches and submits changes, making it convenient for developers to conduct code reviews and Pull Requests. Users can track task execution through centrally managed MCP configuration and detailed logs
AI programmingAI
200
ToolProject
A set of Claude's scientific skills library: claude-scientific-skills, 138 plug-and-play scientific skills, 20+ fields
Provides 138 plug-and-play scientific skills covering more than 20 fields such as bioinformatics, drug discovery, materials science and clinical research, helping researchers accelerate research, process data and conduct scientific education
claude skills
190
ModelProject
Ant Health and others have just open sourced a medical language model: AntAngelMed
AntAngelMed, Ant Health's open source medical language model, is based on the MoE architecture of Ling-flash-2.0. It has 100B total parameters and 6.1B activation parameters. The inference speed exceeds 200 tokens/s and supports 128K context. Ranked first in the HealthBench benchmark test. After three-stage training, it has strong medical knowledge and complex diagnosis and treatment reasoning capabilities, suitable for complex case analysis and daily health questions and answers.
Medical large modelopen sourcemodel
250
ModelProject
Shanghai Artificial Intelligence Laboratory has open sourced [InternVLA-A1], a vision-language-action end-to-end unified model
The Shanghai Artificial Intelligence Laboratory has open sourced InternVLA-A1, a unified vision-language-action model that can perform tasks under natural language prompts, such as "put the pen on the table into the pen holder" without preset coordinates. It solves the problem of information loss between traditional perception and action, and supports multi-modal perception and understanding.
VLAopen sourcemodel
200
ToolProject
A financial application that integrates global data, CFA-level algorithms and AI strategies: FinceptTerminal, which can be used to assist stock selection, valuation, portfolio making, etc.
FinceptTerminal is a financial application that combines global data, CFA-level algorithms and AI strategies to support stock selection, valuation and portfolio management. It can connect to a variety of data sources and has a variety of built-in investment strategies. Users can drag and drop ready-made modules for analysis, simplifying the investment decision-making process.
AI financeAI
270
ToolProject
NVIDIA's latest open source ASR: Nemotron Speech ASR, specializes in low-latency real-time voice agent scenarios, allowing multiple people to talk at the same time without causing delays
Nemotron Speech ASR launched by NVIDIA focuses on low-latency real-time speech processing and supports multi-person conversations. The transcription time of a single sentence is only 24 milliseconds, and the overall latency is less than 500 milliseconds. Through the FastConformer architecture and cache-aware mechanism, voice features are incrementally calculated and delay modes are dynamically configured to meet different application needs.
ASRopen source
260
ModelProject
DiT-based audio and video basic model: LTX-2 has been open source, 19B, directly outputs picture + narration + live audio video
LTX-2 is an audio and video generation model based on DiT. It is open source and supports direct output of pictures, narration and live sound effects. This model is about 18 times faster than Wan 2.2-14B on NVIDIA H100. It is suitable for quickly generating short videos and advertisements, but it may be confusing when multiple people dialogue.
Video generation modelopen sourcemodel
210
ModelProject
Alibaba Maps has also developed a world model: Fantasy World, which currently ranks first in several key indicators of the WorldScore world model rankings.
The world model FantasyWorld launched by Alibaba Maps ranks first in the WorldScore rankings. It aims to solve the shortcomings of existing video models in 3D understanding by unifying video generation and 3D prediction, and improve spatial consistency and 3D reasoning capabilities. This model supports AR/VR content generation and robot navigation, and is suitable for a variety of 3D task scenarios such as navigation, rapid decoration, and e-commerce display.
world modelmodel
200
ModelProject
NVIDIA released: Cosmos Reason 2, the latest reasoning visual language model, specializing in physical AI, with a context length of 256K
NVIDIA has launched Cosmos Reason 2, the latest inference visual language model focused on physical AI, supporting context lengths up to 256K. This model improves spatiotemporal understanding and timestamp accuracy, is capable of 2D/3D point positioning, bounding box coordinates, trajectory data and OCR, and outputs robot actions and motion trajectories. It is suitable for applications such as video analysis, data annotation and safety detection. Available in 2B and 8B models
VLMAImodel
260
ModelProject
Alibaba has just released its latest models: Qwen3-VL-Embedding and Qwen3-VL-Reranker, the core of which is multi-modal information search.
Alibaba launched the Qwen3-VL-Embedding and Qwen3-VL-Reranker models, which support multi-modal information search, including text search for images and image search for videos. This combination improves search efficiency through the two-stage retrieval process of Embedding and Reranking, and supports more than 30 languages. Embedding-8B performs well on MMEB-V2
large modelmodel
370
ToolProject
A popular tool developed by HKUDS of the University of Hong Kong is an AI personalized learning tool: DeepTutor. It solves three problems: too much information and no answers, no one to teach/understand complex knowledge, and no one to ask questions.
DeepTutor is an AI personalized learning tool that can solve the problems of difficulty in finding information, insufficient understanding of complex knowledge, and lack of practice questions. It uses interactive visual learning to break down complex concepts into intuitive illustrations, and generates practice questions or simulation papers based on the user's knowledge level and learning goals, greatly improving learning efficiency.
AI learningAI
230
ToolProject
A Claude Code plug-in: CallMe, which can proactively call Claude when he completes a task, is stuck, or needs to make a decision.
CallMe is a Claude Code plug-in that can proactively make calls when Claude completes a task or makes a decision, without the user having to stare at the computer screen. It supports multiple rounds of voice conversations. You can search, run codes, and read and write files during calls. It is compatible with smartphones, smart watches, and landlines.
AI phone assistant
210
ToolProject
Superpowers: You can use Claude Code's complete development workflow to let the AI "think clearly" according to the process before writing the code, thereby reducing the probability of deviation, rework, and unfinished code.
Superpowers is a complete development workflow designed to help AI think systematically before writing code and reduce the probability of code errors and rework. By splitting each step into verifiable small tasks, AI can complete and cross-check autonomously, and humans can only make decisions at key nodes, thereby improving AI's programming capabilities.
claude skillsAI
180
ToolProject
Very popular recently🔥, an "AI-JSON-UI" project open sourced by Vercel Labs: json-render, solves the problem of inconsistent and difficult to control AI output UI
Vercel Labs' open-source json-render project solves the problem of non-uniform AI-generated UI by defining the Schema of components and attributes, and achieves fast rendering and auditable generative UI. Users can generate complete React source code with one click, which is suitable for scenarios such as data dashboards and dynamic forms.
Generative UIAIopen source
190
ToolProject
An AI artifact for long video conference minutes, interview program editing, and course overview: whisperVideo
WhisperVideo is an AI tool that can convert speech in long videos into text, automatically identify the speaker, match the content with the face in the picture, and generate subtitles and visualization panels with speaker ID. It supports hour-level materials, automatically divided into scenes and paragraphs, and is suitable for interviews, film editing and meeting minutes review. Use WhisperX for text transcription, Pyannote for speaker separation, and SAM3 for face detection. Finally, all information is integrated to generate subtitles and panel views.
video transcriptionAI
210
ToolProject
The latest 1B end-to-end OCR: LightOnOCR-2-1B, a single card H100 achieves 5.7 pages per second, and the cost per thousand pages is less than US$0.01
The latest LightOnOCR-2-1B model performs well on OlmOCR-Bench and is 1.7 times faster than OlmOCR. It can handle complex documents and multi-language content. A single card H100 processes 5.7 pages per second, and the cost per thousand pages is less than US$0.01.
OCR
190
ToolProject
A tool for real-time multi-AI collaboration: Claude Code Bridge, which allows Claude, Codex, Gemini, and OpenCode to work side by side on the same interface, making the entire process visible, controllable, and sustainable.
Claude Code Bridge is a real-time multi-AI collaboration tool that supports Claude, Codex, Gemini and OpenCode to work in parallel on the same interface. Each model can locally persist the conversation history and support one-click recovery. This tool uses a pure CLI workflow and automatic layout, suitable for tasks such as code review, architecture design and troubleshooting.
CLIAIopen source
240
ToolProject
An AI short drama automatic generation tool that has become very popular in the past two days: Huobao Drama, which automates the entire process of scripts, characters, storyboards, and video synthesis.
Huobao Drama is a popular AI short drama automatic generation tool that can automate the entire process from creative conception to final video output. It makes short play production efficient and can be completed by one person and one computer in one hour. It is suitable for quickly verifying concepts and producing customized short plays. It is especially outstanding in chasing hot topics.
video generationAI
240
ToolProject
Slip, a clawdbot-skills library covering 30+ common scenarios: awesome-clawdbot-skills
The clawdbot-skills library provides skills for more than 30 common scenarios, covering front-end development, DevOps, automation, financial management, health, smart home and other fields. Users can find ready-made solutions here
skills
250
FrameworkProject
Come to a hard-core [Claude Code family bucket configuration], 10 months of practical experience and accumulated into a plug-in, allowing Claude Code to directly upgrade to the "professional" level
This is a set of systematic Claude Code configuration plug-ins. After 10 months of practical experience, it is designed to improve programming efficiency. The configuration covers many aspects such as agents, skills, and hooks, and supports high-yield and parallel use of multiple instances. Users can make secondary modifications as needed to avoid memory problems caused by too many tools, and provide dynamic balancing suggestions.
Learning and Tutorials
210
ToolProject
Hey, DeepSeek has just released DeepSeek-OCR-2, the latest OCR, which focuses on a more human-like visual coding method.
DeepSeek-OCR-2 is the latest OCR tool. It adopts a more human-like visual coding method, can process the entire page as a "visual causal flow", and supports the conversion of complex typesetting, handwriting, formulas and tables into Markdown. It scored 91.09% on OmniDocBench v1.5, 3.73% higher than the previous generation. The model supports dynamic resolution and can adapt to document images of different sizes and layouts, improving practicality.
OCR
190
ToolProject
A super popular GitHub project, a lightweight rendering tool that can turn Mermaid text charts into high-looking charts in seconds: beautiful-mermaid
beautiful-mermaid is a lightweight rendering tool that can quickly convert Mermaid text charts into high-value charts. It supports the output of SVG and ASCII/Unicode frame diagrams, with clarity and beauty. It has 15 built-in themes with zero DOM dependency. It is suitable for documentation, CLI and AI use. It can be applied with one click after loading the VS Code theme.
AI chart generation
200
ToolProject
Zhipu just released another OCR model: GLM-OCR, PDF 1.86 pages per second, 1 yuan can run 2000 A4 scanned images, 0.9B, can be deployed at the edge
GLM-OCR is an efficient OCR tool that processes PDF documents at a speed of 1.86 pages per second. It can process approximately 2,000 A4 scanned images for 1 yuan, and the cost is only one-tenth of traditional OCR. It supports handwriting, complex tables and multi-language mixing, is suitable for high concurrency and edge deployment, and scored a high score of 94.6 on OmniDocBench V1.5. Can be deployed through vLLM, SGLang and Ollama to reduce inference latency and computing power overhead
OCR
210
FrameworkProject
A tutorial to teach you how to create an AI programming agent step by step: learn-claude-code, built from scratch
Provides a tutorial for building an AI programming agent from scratch, covering the cyclic use of models and tools, gradually increasing the number and functions of tools, and learning paths suitable for different needs. The number of lines of code ranges from 50 to 550, suitable for teaching and practical applications.
Learning and TutorialsAI
230
ModelProject
Alibaba has just open sourced the latest 80B MoE programming model for 3B activation: Qwen3-Coder-Next
Alibaba has open sourced the latest 80B activated 3B MoE programming model Qwen3-Coder-Next, which is good at long-term and multi-tool programming tasks, with a solution rate of over 70%. This model can understand requirements, write code, interact with the environment and complete tasks, and supports local IDE plug-ins, CLI Agent and enterprise intranet deployment
LLMopen sourcemodel
350
ToolProject
Mistral has just released its latest speech-to-text model: Voxtral Transcribe 2, with real-time latency below 200ms and support for speaker separation.
Mistral has launched the Voxtral Transcribe 2 speech-to-text model with real-time latency of less than 200ms and supports speaker separation. The model is available in two versions: Voxtral Realtime is suitable for real-time applications, with delays configurable to within 200ms and word error rates close to the offline version; Voxtral Mini Transcribe 2 is suitable for batch processing and supports 13 languages and word-level timestamps
STTmodel
210
ToolProject
Claude Code [Intelligent Review Optimization] has a new feature, a /insights, which will go over the conversations of the past 30 days and give a suggestion for optimizing the workflow.
Claude Code's new feature "/insights" can review conversations over the past 30 days and provide suggestions for optimizing workflow, helping users complete programming work more efficiently, including saving time and improving processes. This feature upgrades AI from tool assistance to personalized efficiency engine
renew
200
ToolProject
A more practical code review skill package: code-review-expert, which allows AI to conduct a comprehensive and structured review of code like a senior engineer.
code-review-expert is an open source code review skills package that simulates the review perspective of a senior engineer, covering multiple key dimensions such as architecture, security, and performance. It provides SOLID principle checking, security scanning, performance optimization, error handling and code cleaning functions, and has a seven-step review process and P0-P3 grading system.
skillsAI
200
OtherProject
MobileAgent
The mobile GUI Agent series produced by Alibaba X-PLUG allows AI to autonomously operate the mobile phone interface to complete complex tasks. Supports multiple large model drivers, 7.9k+ stars.
GUI Agentmobile terminalautomation
200
OtherProject
CC-Switch
Cross-platform desktop All-in-One assistant, unified management of multiple AI programming tools such as Claude Code, Codex, OpenCode, Gemini CLI, etc. 23k+ stars, one interface to switch all programming agents.
Desktop toolsAI programmingmulti-Agent
210
OtherProject
PageIndex
The vector-free RAG document indexing solution launched by VectifyAI implements document question and answer through reasoning rather than vector retrieval, with 20k+ stars. Subverting the traditional RAG process, document content can be accurately located without embedding.
RAGdocument indexAI search
210
OtherProject
Claude Code
The terminal AI programming assistant officially produced by Anthropic understands your code base, performs daily tasks, interprets complex codes, and handles git workflow through natural language commands. Phenomenal tool with 73k+ stars.
AI programmingterminal toolscode assistant
210
ToolProject
vLLM
High-throughput LLM inference and serving engine, using PagedAttention technology, 24x faster than HuggingFace
Model inferencehigh performanceservice deployment
22760
FrameworkProject
RAGFlow
RAG engine based on deep document understanding supports accurate parsing of complex format documents and high-quality Q&A
RAGdocument analysisknowledge base
24480
ModelProject
Whisper
OpenAI is an open source universal speech recognition model that supports multi-language speech to text and translation with extremely high accuracy.
Speech recognitionASRmulti-language
291100
AgentProject
MetaGPT
A multi-agent framework that simulates the collaboration process of a software company and automatically generates PRD, design, code and testing requirements in one sentence
multi-agentsoftware developmentautomation
22650
ToolProject
ChatGPT-Next-Web
Deploy your own ChatGPT web application with one click, supporting GPT-4, Gemini Pro and other models
ChatGPTone-click deploymentmultiple models
22720
ToolProject
ComfyUI
Stable Diffusion interface based on node workflow, flexible and powerful image generation pipeline orchestration tool
AI paintingworkflowStable Diffusion
24890
FrameworkProject
FastGPT
LLM-based knowledge base Q&A system, ready to use out of the box, supports data import, automatic segmentation, vectorization, and workflow orchestration
Knowledge baseRAGquestion and answer system
23560
ToolProject
Stable Diffusion WebUI
Stable Diffusion's browser interface supports all functions of txt2img, img2img, ControlNet, etc.
AI paintingStable Diffusionimage generation
411350
ToolProject
Ollama
在本地运行 Llama 3、Mistral、Gemma 等大语言模型,简单易用的命令行工具
Local deploymentLlamamodel inference
281200
FrameworkProject
Dify
Open source LLM application development platform, providing visual prompt orchestration, RAG pipeline, Agent framework and model management
LLM platformRAGlow code
25870
ToolProject
Open WebUI
Feature-rich self-hosted AI chat interface, supports Ollama and OpenAI compatible API, deploy your own ChatGPT with one click
ChatGPTself-deploymentOllama
26980
FrameworkProject
LangChain
A powerful framework for building LLM applications, providing core capabilities such as chain calling, Agent, RAG, etc., and is the preferred tool for AI application development
LLMRAGAgent
331520