2024-05-26 新增开源LLM

llm

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, …) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.

LlamaIndex is a data framework for your LLM applications

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Dify is an open-source LLM app development platform. Dify’s intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain

A cloud-native vector database, storage for next generation AI applications

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT4.0/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

🐚 OpenDevin: Code Less, Make More

Drag & drop UI to build your customized LLM flow

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Unify Efficient Fine-Tuning of 100+ LLMs

The platform for customizing AI from enterprise data

🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

A high-throughput and memory-efficient inference and serving engine for LLMs

互联网公司技术架构,微信/淘宝/微博/腾讯/阿里/美团点评/百度/OpenAI/Google/Facebook/Amazon/eBay的架构,欢迎PR补充

Integrate cutting-edge LLM technology quickly and easily into your apps

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Lord of Large Language Models Web User Interface

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Enable everyone to develop, optimize and deploy AI models natively on everyone’s devices.

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

The container platform tailored for Kubernetes multi-cloud, datacenter, and edge management ⎈ 🖥 ☁️

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it’s best suited for building RAG, question answering, semantic search or conversational agent chatbots.

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.

The simplest way to run LLaMA on your local machine

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

The open-source hub to build & deploy GPT/LLM Agents ⚡️

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

⏩ Open-source VS Code and JetBrains extensions that enable you to easily create your own modular AI software development system

  • QwenLM/Qwen : ✩11612 ✎Python ⏱2024-05-22 00:00:00

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

Low-code framework for building custom LLMs, neural networks, and other AI models

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/

High-performance In-browser LLM Inference Engine

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

Gorilla: An API store for LLMs

📋 A list of open LLMs available for commercial use.

Create LLM agents with long-term memory and custom tools 📚🦙

Machine Learning Engineering Open Book

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

AI driven development in your terminal. Designed for large, real-world tasks.

Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)

Python scraper based on AI

The official GitHub page for the survey paper “A Survey of Large Language Models”.

Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.

Official inference library for Mistral models

Build AI Assistants with memory, knowledge and tools.

Personalizing LLM Responses

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

The user analytics platform for LLMs

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Your friendliest open source all-in-one automation tool ✨ Workflow automation tool 100+ integration / Enterprise automation tool / Zapier Alternative

  • vercel/ai : ✩8075 ✎TypeScript ⏱2024-05-25 00:00:00

Build AI-powered applications with React, Svelte, Vue, and Solid

TypeChat is a library that makes it easy to build natural language interfaces using types.

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

Your AI second brain. A copilot to get answers to your questions, whether they be from your own notes or from the internet. Use powerful, online (e.g gpt4) or private, local (e.g llama3) LLMs. Self-host locally or use our cloud instance. Access from Obsidian, Emacs, Desktop app, Web or Whatsapp.

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.

Bisheng is an open LLM devops platform for next generation AI applications.

  • neuml/txtai : ✩7159 ✎Python ⏱2024-05-25 00:00:00

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Building a quick conversation-based search demo with Lepton AI.

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.

😎 Awesome list of tools and projects with the awesome LangChain framework

本项目旨在分享大模型相关技术原理以及实战经验。

The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

A GPT-empowered penetration testing tool

Open source codebase powering the HuggingChat app

LLM Frontend for Power Users.

  • e2b-dev/E2B : ✩6216 ✎TypeScript ⏱2024-05-24 00:00:00

Secure cloud runtime for AI apps & AI agents. Fully open-source.

A framework for building custom AI Copilots 🤖 in-app AI chatbots, in-app AI Agents, & AI-powered Textareas.

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max). A PyTorch LLM library that seamlessly integrates with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, etc.

An ecosystem of Rust libraries for working with large language models

Build ChatGPT over your data, all with natural language

Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI

Build Conversational AI in minutes ⚡️

The GPU-powered AI application database. Get your app to market faster using the simplicity of SQL and the latest NLP, ML + LLM models.

The paper list of the 86-page paper “The Rise and Potential of Large Language Model Based Agents: A Survey” by Zhiheng Xi et al.

The unofficial python package that returns response of Google Bard through cookie value.

Retrieval and Retrieval-augmented LLMs

Official release of InternLM2 7B and 20B base and chat models. 200K context support

LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.

The open source platform for AI-native application development.

A UI-Focused Agent for Windows OS Interaction.

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Automate browser-based workflows with LLMs and Computer Vision

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

A curated list of modern Generative Artificial Intelligence projects and services

🤖 🔥 Language-to-actions engine

Firefly: 大模型训练工具,支持训练Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

✨ Build AI interfaces that spark joy

A curated list of GPT agents for cybersecurity

A code-first agent framework for seamlessly planning and executing data analytics tasks.

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl, search and extract with a single API.

A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.

《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程

An Open-source Framework for Autonomous Language Agents

🥷 Run AI-agents with an API

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Large Action Model framework to develop AI Web Agents

Prompt Engineering, Generative AI, and LLM Guide by Learn Prompting | Join our discord for the largest Prompt Engineering learning community

A simple one-file way to run various GGML and GGUF models with KoboldAI’s UI

Unified framework for building enterprise RAG pipelines with small, specialized models

Robust recipes to align language models with human and AI preferences

A tool for evaluating LLMs

🪢 Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

LLM training code for Databricks foundation models

Tools for merging pretrained large language models.

Instant answers to any programming question

🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation

State of the Art Natural Language Processing

ZenML 🙏: Build portable, production-ready MLOps pipelines. https://zenml.io .

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://h2oai.github.io/h2o-llmstudio/

OpenAgents: An Open Platform for Language Agents in the Wild

Power CLI and Workflow manager for LLMs (core package)

Collection of AI-related utilities. Welcome to submit issues and pull requests /收藏AI相关的实用工具,欢迎提交issues 或者pull requests

Curated tutorials and resources for Large Language Models, AI Painting, and more.

Adding guardrails to large language models.

33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU

🐢 Open-Source Evaluation & Testing for LLMs and ML models

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

General technology for enabling AI capabilities w/ LLMs and MLLMs

雅意大模型:为客户打造安全可靠的专属大模型,基于大规模中英文多领域指令数据训练的 LlaMA 2 & BLOOM 系列模型,由中科闻歌算法团队研发。(Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM)

Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

One-stop system for shared use of AI within teams and organizationswith | 在团队和组织内共享使用人工智能的一站式系统

又名 <零生万物> , 是一本专属于软件开发工程师的 AI 入门圣经,手把手带你上手写 AI。从神经网络到大模型,从高层设计到微观原理,从工程实现到算法,学完后,你会发现 AI 也并不是想象中那么高不可攀、无法战胜,Just beat it !

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.

基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答

Training and serving large-scale neural networks with auto parallelization.

🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Seamlessly integrate LLMs into scikit-learn.

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you’re empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

The no-code platform for building custom LLM Agents

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.一键免费部署你的私人AutoGPT 网页应用

An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM2, Llama3, Phi3, Qwen, Mistral, …)

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Effortless data labeling with AI support from Segment Anything and other awesome models.

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Database system for AI-powered apps

This codebase is for a React and Electron-based app that executes the FreedomGPT LLM locally (offline and private) on Mac and Windows using a chat-based interface

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

Jupyter Notebooks to help you get hands-on with Pinecone vector databases

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

The open-source visual AI programming environment and TypeScript library

  • YiVal/YiVal : ✩2462 ✎Python ⏱2024-04-22 00:00:00

Your Automatic Prompt Engineering Assistant for GenAI Applications

Automatic Generation of Visualizations and Infographics using Large Language Models

Start building LLM-empowered multi-agent applications in an easier way.

Code examples and resources for DBRX, a large language model developed by Databricks

Turn expensive prompts into cheap fine-tuned models

An open-source ChatGPT app with a voice

⚡️Open-source AI LangChain-like RAG (Retrieval-Augmented Generation) knowledge database with web UI and Enterprise SSO⚡️, supports OpenAI, Azure, LLaMA, Google Gemini, HuggingFace, Claude, Grok, etc., chat bot demo: https://demo.casibase.com , admin UI demo: https://demo-admin.casibase.com

Run Mixtral-8x7B models in Colab or consumer desktops

开源社区第一个能下载、能运行的中文 LLaMA2 模型!

TigerBot: A multi-language multi-task LLM

💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram

总结Prompt&LLM论文,开源数据&模型,AIGC应用

Thinking notebook and Markdown editor with LLM wingman.

Interact with your SQL database, Natural Language to SQL using LLMs

🤖Self-Modifying Framework from the Future 🔮 World’s First AMS

A blazing fast inference solution for text embeddings models

Data processing with ML and LLM

A curated list of Generative AI tools, works, models, and references

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

A quick guide (especially) for trending instruction finetuning datasets

A colab gradio web UI for running Large Language Models

  • getzep/zep : ✩2050 ✎Go ⏱2024-04-27 00:00:00

Zep: Long-Term Memory for ‍AI Assistants.

Simple UI for LLM Model Finetuning

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Production ready AI assistant framework

The AI Agent Framework in .NET

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

eBPF Developer Tutorial: Learning eBPF Step by Step with Examples

Developer AI Persona Search Agent

Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use llama2-wrapper as your local llama2 backend for Generative Agents/Apps.

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

🔮 Instill Core is an open-source no-/low-code data, model and pipeline orchestration platform, providing a full-stack solution for AI-first applications

Magicoder: Source Code Is All You Need

Official Implementation of “Graph of Thoughts: Solving Elaborate Problems with Large Language Models”

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

收集和梳理垂直领域的开源模型、数据集及评测基准。

Label, clean and enrich text datasets with LLMs.

🕹️ Open-source, developer-first LLMOps platform designed to streamline prompt design, version management, instant delivery, collaboration, troubleshooting, observability and more.

Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting

Resource list for generating JSON using LLMs via function calling, tools, CFG. Libraries, Models, Notebooks, etc.

AICI: Prompts as (Wasm) Programs

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

OpenAGI: When LLM Meets Domain Experts

Community for applying LLMs to robotics and a robot simulator with ChatGPT integration

A program that provides LLMs with the ability to complete complex tasks using plugins.

Harness LLMs with Multi-Agent Programming

Python bindings for the Transformer models implemented in C/C++ using GGML library.

Awesome papers about unifying LLMs and KGs

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

Evaluation and Tracking for LLM Experiments

Seamlessly integrate LLMs as Python functions

LLM Finetuning with peft

  • Nixtla/nixtla : ✩1676 ✎Jupyter Notebook ⏱2024-05-23 00:00:00

TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and anomaly detection. Generative pretrained transformer for time series trained on over 100B data points. It’s capable of accurately predicting various domains such as retail, electricity, finance, and IoT with just a few lines of code 🚀.

Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.

LangServe 🦜️🏓

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

  • cztomsik/ava : ✩339 ✎TypeScript ⏱2024-05-08 00:00:00

All-in-one desktop app for running LLMs locally.

Drop-in, local AI alternative to the OpenAI stack. Multi-engine (llama.cpp, TensorRT-LLM). Powers 👋 Jan

Blazingly fast LLM inference.

  • ysymyth/ReAct : ✩1641 ✎Jupyter Notebook ⏱2024-02-06 00:00:00

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models

永久免费开源的 AIGC 课程, 目前已支持 ChatGPT, Midjourney, Runway, Stable Diffusion, AI数字人,AI声音&音乐,大模型微调

ms-swift: Use PEFT or Full-parameter to finetune 200+ LLMs or 15+ MLLMs

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

⚡ Langchain apps in production using Jina & FastAPI

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!

Turn any webpage into structured data using LLMs

OpenDAN is an open source Personal AI OS , which consolidates various AI modules in one place for your personal use.

LLM(😽)

WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

Project Page for “LISA: Reasoning Segmentation via Large Language Model”

Distributed ML Training and Fine-Tuning on Kubernetes

Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.

Automated Machine Learning on Kubernetes

LlamaIndex is a data framework for your LLM applications

AIdea 是一款支持 GPT 以及国产大语言模型通义千问、文心一言等,支持 Stable Diffusion 文生图、图生图、 SDXL1.0、超分辨率、图片上色的全能型 APP。

Llama 2 Everywhere (L2E)

基于大模型的智能对话客服工具,支持微信、千牛、哔哩哔哩、抖音企业号、抖音、抖店、微博聊天、小红书专业号运营、小红书、知乎等平台接入,可选择 GPT3.5/GPT4.0/ 懒人百宝箱 (后续会支持更多平台),能处理文本、语音和图片,通过插件访问操作系统和互联网等外部资源,支持基于自有知识库定制企业 AI 应用。

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.

Open-source observability for your LLM application, based on OpenTelemetry

Documentation for Google’s Gen AI site - including the Gemini API and Gemma

Your personal code reviewer powered by LLMs (OpenAI GPT-3.5/4) & Embeddings ⚡️ Improve code quality and catch bugs before you break production 🚀 Lives in your Github/GitLab/Azure DevOps CI

The open-source React.js Autonomous LLM Agent

Text generator is a handy plugin for Obsidian that helps you generate text content using GPT-3 (OpenAI).

OpenAI API client for Kotlin with multiplatform and coroutines capabilities.

A simple, performant and scalable Jax LLM!

Knowledge is a tool for saving, searching, accessing, exploring and chatting with all of your favorite websites, documents and files.

Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models.

Must-read Papers on LLM Agents.

The official GitHub page for the survey paper “A Survey on Evaluation of Large Language Models”.

The framework for fast development and deployment of RAG systems.

Index and query any data using LLM and natural language, tracking sources and showing citations.

LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI.

《构筑大语言模型应用:应用开发与架构设计》一本关于 LLM 在真实世界应用的开源电子书,介绍了大语言模型的基础知识和应用,以及如何构建自己的模型。其中包括Prompt的编写、开发和管理,探索最好的大语言模型能带来什么,以及LLM应用开发的模式和架构设计。

llm-chain is a powerful rust crate for building chains in large language models allowing you to summarise text and complete complex tasks

Run AI-powered workflows over your codebase

A collection of reference Jupyter notebooks and demo AI/ML applications for enterprise use cases: marketing, pricing, supply chain, smart manufacturing, and more.

RayLLM - LLMs on Ray

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

收集 CVPR 最新的成果,包括论文、代码和demo视频等,欢迎大家推荐!Collect the latest CVPR (Conference on Computer Vision and Pattern Recognition) results, including papers, code, and demo videos, etc., and welcome recommendations from everyone!

llama.go is like llama.cpp in pure Golang!

LLMs for your CLI

Explore large language models in 512MB of RAM

An elegent pytorch implement of transformers

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL

The TypeScript framework for AI development

🚀🚀🚀 A collection of some awesome public YOLO object detection series projects.

All-in-one infrastructure for building search, recommendations, and RAG. Trieve combines search language models with tools for tuning ranking and relevance.

Local CLI Copilot, powered by CodeLLaMa. 💻🦙

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

Awesome things about LLM-powered agents. Papers / Repos / Blogs / …

how to optimize some algorithm in cuda.

The Gemini API Python SDK enables developers to use Google’s state-of-the-art generative AI models to build AI-powered features and applications.

Codes for “Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models”.

Fast, private, controllable pre-trained models in Rust

Open-source tool to visualise your RAG 🔮

《AI 研发提效研究:自己动手训练 LoRA》,包含 Llama (Alpaca LoRA)模型、ChatGLM (ChatGLM Tuning)相关 Lora 的训练。训练内容:用户故事生成、测试代码生成、代码辅助生成、文本转 SQL、文本生成代码……

Auto-GPT on the browser

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

🦙 Integrating LLMs into structured NLP pipelines

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

LangChain powered shell command generator and runner CLI

Efficient Retrieval Augmentation and Generation Framework

LLM Prompt Injection Detector

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance

Text-To-Speech, RAG, and LLMs. All local!

Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A

ChatGPT, GenerativeAI and LLMs Timeline

Ship RAG based LLM web apps in seconds.

Traditional Mandarin LLMs for Taiwan

local-first semantic code search engine

[Paper List] Papers integrating knowledge graphs (KGs) and large language models (LLMs)

Inference code and configs for the ReplitLM model family

🦜⛏️ Did you say you like data?

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

The Security Toolkit for LLM Interactions

Compare the performance of different LLM that can be deployed locally on consumer hardware. Run yourself with Colab WebUI.

Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone

LLM vulnerability scanner

A modular and comprehensive solution to deploy a Multi-LLM and Multi-RAG powered chatbot (Amazon Bedrock, Anthropic, HuggingFace, OpenAI, Meta, AI21, Cohere) using AWS CDK on AWS

The production toolkit for LLMs. Observability, prompt management and evaluations.

The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.

A lightweight framework for building LLM-based agents

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

AIConfig is a config-based framework to build generative AI applications.

A curated list for Efficient Large Language Models

Serving multiple LoRA finetuned LLM as one

Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.

Tensor parallelism is all you need. Run LLMs on weak devices or make powerful devices even more powerful by distributing the workload and dividing the RAM usage.

《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。

🤖 A PyTorch library of curated Transformer models and their composable components

A curated list of language modeling researches for code and related datasets.

📊 llm.report is an open-source logging and analytics platform for OpenAI: Log your ChatGPT API requests, analyze costs, and improve your prompts.

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops

A curated list of awesome resources, tools, and other shiny things for GPT prompt engineering.

🌟 ChatGenTitle:使用百万arXiv论文信息在LLaMA模型上进行微调的论文题目生成模型

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.

[ACL 2023] Reasoning with Language Model Prompting: A Survey

Data and tools for generating and inspecting OLMo pre-training data.

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.

日本語LLMまとめ - Overview of Japanese LLMs

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

Survey: A collection of AWESOME papers and resources on the large language model (LLM) related recommender system topics.

中文法律LLaMA (LLaMA for Chinese legel domain)

ChatGPT and Bing AI prompt curation

What would you do with 1000 H100s…

🔒 Enterprise-grade API gateway that helps you monitor and impose cost or rate limits per API key. Get fine-grained access control and monitoring per user, application, or environment. Supports OpenAI, Azure OpenAI, Anthropic, vLLM, and open-source LLMs.

Collection of leaked system prompts

A toolkit for inference and evaluation of ‘mixtral-8x7b-32kseqlen’ from Mistral AI

Pure Rust implementation of a minimal Generative Pretrained Transformer

Scale LLM Engine public repository

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper “Retrieval-Augmented Generation for AI-Generated Content: A Survey”.

One API for all LLMs either Private or Public (Anthropic, Llama V2, GPT 3.5/4, Vertex, GPT4ALL, HuggingFace …) 🌈🐂 Replace OpenAI GPT with any LLMs in your app with one line.

An LLM-powered advanced RAG pipeline built from scratch

Full stack framework for building cross-platform mobile AI apps

SkyAGI: Emerging human-behavior simulation capability in LLM

LM Studio CLI. Written in TypeScript/Node

A fast and lightweight framework for creating decentralized agents with ease.

The online version is temporarily unavailable because we cannot afford the key. You can clone and run it locally. Note: we set defaul openai key. If keys exceed plan and are invalid, please tell us. The response speed depends on openai. ( sometimes, the official is too crowded and slow)

Unofficial API for Claude-2 via Claude Web (Also CLI)

PhoGPT: Generative Pre-training for Vietnamese (2023)

Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。

Practical course about Large Language Models.

LLM powered fuzzing via OSS-Fuzz.

Salesforce open-source LLMs with 8k sequence length.

A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine

Agents Capable of Self-Editing Their Prompts / Python Code

明医 (MING):中文医疗问诊大模型

A curation of awesome tools, documents and projects about LLM Security.

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.

An open-source, high-performance SQL vector database built on ClickHouse.

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

SEAIT is a user-friendly application that simplifies the installation process of AI-related projects

Make Llama2 use Code Execution, Debug, Save Code, Reuse it, Access to Internet

Nvidia GPU exporter for prometheus using nvidia-smi binary

Run AI models locally on your machine with node.js bindings for llama.cpp. Force a JSON schema on the model output on the generation level

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

🚀 大语言模型高效转发服务 · An efficient forwarding service designed for LLMs. · OpenAI API Reverse Proxy

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

Chat with your documents offline using AI.

DriveLM: Driving with Graph Visual Question Answering

Easy-to-use headless React Hooks to run LLMs in the browser with WebGPU. Just useLLM().

Python & JS/TS SDK for adding code interpreting to your AI app

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Prompt engineering for developers

a lightweight LLM model inference framework

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

Make your own story. User-friendly software for LLM roleplaying

中文Mixtral-8x7B(Chinese-Mixtral-8x7B)

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.

A LLM semantic caching system aiming to enhance user experience by reducing response time via cached query-result pairs.

List of language agents based on paper “Cognitive Architectures for Language Agents”

LLMFlows - Simple, Explicit and Transparent LLM Apps

An open-source educational chat model from ICALK, East China Normal University. 开源中英教育对话大模型。(通用基座模型,GPU部署,数据清理) 致敬: LLaMA, MOSS, BELLE, Ziya, vLLM

Large Language Models (LLMs) tutorials & sample scripts, ft. langchain, openai, llamaindex, gpt, chromadb & pinecone

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

🗣️ Chat with LLM like Vicuna totally in your browser with WebGPU, safely, privately, and with no server. Powered by web llm.

A database of SDKs, frameworks, libraries, and tools for creating, monitoring, debugging and deploying autonomous AI agents

A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese

Rate limiting, caching, and request prioritization for modern workloads

A life simulation Game powered by GPT-4’s “Advanced Data Analysis” function , offering you a second chance at life. 由GPT4的Advanced Data Analysis功能驱动的人生重来模拟器,给您人生第二春。

The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim and Neovim.

AWS-native chatbot using Bedrock + Claude

Using Tree-of-Thought Prompting to boost ChatGPT’s reasoning

Automata: A self-coding agent

Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.

Active learning for systematic reviews

Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks

Unified Reinforcement Learning Framework

A lightweight AI Agent

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.

[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies

A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.

Best practice for training LLaMA models in Megatron-LM

Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.

AI画图。每天一句中国古诗词,生成 AI 图片 Powered by Bing DALL-E-3.

Leverage hallucinations from Large Language Models (LLMs) for novelty-driven explorations.

利用LLM构建应用实践笔记

Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of 🤗Huggingface.

WrenAI makes your database RAG-ready. Implement Text-to-SQL more accurately and securely.

🎒 local.ai - Run AI locally on your PC!

MobiLlama : Small Language Model tailored for edge devices

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

automatically tests prompt injection attacks on ChatGPT instances

QiZhenGPT: An Open Source Chinese Medical Large Language Model|一个开源的中文医疗大语言模型

A framework to evaluate the generalization capability of safety alignment for LLMs

Automatically create prompts and make them fight each other to know which is the best

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

LLM-based ontological extraction tools, including SPIRES

中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)

Play LLaMA2 (official / 中文版 / INT4 / llama2.cpp) Together! ONLY 3 STEPS! ( non GPU / 5GB vRAM / 8~14GB vRAM)

Command-line interface for ChatGPT, Claude and Bard

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy’s makemore :)

Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure.

Build a chatbot or Q&A bot of your website’s content

[ICML'24] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

🪿 LinGoose is a Go framework for building awesome AI/LLM applications.

Official implementation of Half-Quadratic Quantization (HQQ)

Java version of LangChain, while empowering LLM for Big Data.

CSGHub is an opensource large model assets platform just like on-premise huggingface which helps to manage datasets, model files, codes and more. CSGHub是一个开源、可信的大模型资产管理平台,可帮助用户治理LLM和LLM应用生命周期中涉及到的资产(数据集、模型文件、代码等)。CSGHub提供类似私有化的Huggingface功能,以类似OpenStack Glance管理虚拟机镜像、Harbor管理容器镜像以及Sonatype Nexus管理制品的方式,实现对LLM资产的管理。欢迎关注反馈和Star⭐️

Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMs

Large Language Model-enhanced Recommender System Papers

This repository contains a collection of papers and resources on Reasoning in Large Language Models.

Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI]

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

  • r2d4/rellm : ✩492 ✎Python ⏱2023-08-10 00:00:00

Exact structure out of any language model completion.

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 40+ HF models, 20+ benchmarks

Framework that joins data models, schemas, code generation, and a task engine. Language and technology agnostic.

Talk to any ArXiv paper using ChatGPT

A realtime and indexing and structured extraction engine for Unstructured Data to build Generative AI Applications

Azure OpenAI (demos, documentation, accelerators).

Synmetrix – open source semantic layer / Boost your LLM precision

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

[DEPRECATED] Powered by AI Playgrounds by E2B. Code interpreter on steroids for ChatGPT. Run any language, any terminal process, use filesystem freely. All with access to the internet.

🐋MindChat(漫谈)——心理大模型:漫谈人生路, 笑对风霜途

🚀🚀🚀A collection of some awesome public projects about Large Language Model, Vision Foundation Model and AI Generated Content.

RAG for Local LLM, chat with PDF/doc/txt files, ChatPDF

A simple “Be My Eyes” web app with a llama.cpp/llava backend

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

  • nluxai/nlux : ✩457 ✎TypeScript ⏱2024-05-25 00:00:00

The 𝗣𝗼𝘄𝗲𝗿𝗳𝘂𝗹 Conversational AI JavaScript Library

Elixir implementation of a LangChain style framework.

Self-evaluating interview for AI coders

A reading list for large models safety, security, and privacy.

🤖 A prototype assistant for writing and thinking

Generate textbook-quality synthetic LLM pretraining data

  • BigJk/snd : ✩447 ✎TypeScript ⏱2024-04-19 00:00:00

Sales & Dungeons — Thermal Printer as D&D / TTRPG Utility

Large language model evaluation and workflow framework from Phase AI.

Repo for fine-tuning Casual LLMs

An open collection of implementation tips, tricks and resources for training large language models

🛠️ User-friendly bash script for setting up and configuring your LocalAI server with the GPT4All for free! 💸

A tiny implementation of an autonomous agent powered by LLMs (OpenAI GPT-4)

A complete guide to start and improve your LLM skills in 2024 with little background in the field and stay up-to-date with the latest news and state-of-the-art techniques!

PULSE: Pretrained and Unified Language Service Engine

LLMs and Machine Learning done easily

Are Copilots Local Yet? The frontier of local LLM Copilots for code completion, project generation, shell assistance, and more. Find tools shaping tomorrow’s developer experience, today!

Build AI WhatsApp Bots with Pure Python

Go SDK for Google Generative AI

The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.

Large Language Models for All, 🦙 Cult and More, Stay in touch !

心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3

Generative Representational Instruction Tuning

An intuitive approach to building with LLMs

AI Vim/Neovim code generation plugin (OpenAI, ChatGPT, and more)

Run AI ✨ assistant locally! with simple API for Node.js 🚀

[ACL 2024] Progressive LLaMA with Block Expansion.

An open collection of methodologies to help with successful training of large language models.

An AI code interpreter for sensitive data, powered by GPT-4 or Code Llama / Llama 2.

Language modeling and instruction tuning for Russian

Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.

Terminal UI to chat with large language models (LLM) using different model backends, and integrations with your favourite editors!

Video Search and Streaming Agent 🕵️‍♂️

structured extraction for llms

🚀WebUI integrated platform for latest LLMs | 各大语言模型的全流程工具 WebUI 整合包。支持主流大模型API接口和开源模型。支持知识库,数据库,角色扮演,mj文生图,LoRA和全参数微调,数据集制作,live2d等全流程应用工具

🦋 Production grade chaining engine behind TuneChat. Self host today!

A localized open-source AI server that is better than ChatGPT.

pykoi: Active learning in one unified interface

Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface

C# implementation of LangChain. We try to be as close to the original as possible in terms of abstractions, but are open to new entities.

🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.

轻松玩转LLM兼容openai&langchain,支持文心一言、讯飞星火、腾讯混元、智谱ChatGLM等

Querying local documents, powered by LLM

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

RTP-LLM: Alibaba’s high-performance LLM inference engine for diverse applications.

INSIGHT is an autonomous AI that can do medical research!

freeGPT provides free access to text and image generation models.

Inference Llama 2 in one file of pure Python

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

Experience the power of Clarifai’s AI platform with the python SDK. 🌟 Star to support our work!

Bridging LLM and Recommender System.

Smoothly Manage Multiple LLMs (OpenAI, Anthropic, Azure) and Image Models (Dall-E, SDXL), Speed Up Responses, and Ensure Non-Stop Reliability.

A program synthesis agent that autonomously fixes its output by running tests!

A fast and minimal framework for building agent-integrated systems

🕵️‍♂️ Library designed for developers eager to explore the potential of Large Language Models (LLMs) and other generative AI through a clean, effective, and Go-idiomatic approach.

Create characters in Unity with LLMs!

Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)

Grounded Multimodal Large Language Model with Localized Visual Tokenization

Agent Cloud is like having your own GPT builder with a bunch extra goodies. The GUI features 1) RAG pipeline which can natively embed 260+ datasources 2) Create Conversational apps (like GPTs) 3) Create Multi Agent process automation apps (crewai) 4) Tools 5) Teams+user permissions. Get started fast with Docker and our install.sh

alpaca中文指令微调数据集

Simple go utility to download HuggingFace Models and Datasets

Directly Connecting Python to LLMs via Strongly-Typed Functions, Dataclasses, Interfaces & Generic Types

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Reflexion: an autonomous agent with dynamic memory and self-reflection

Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

多模态中文LLaMA&Alpaca大语言模型(VisualCLA)

Control Any Computer Using LLMs

ROS-LLM is a framework designed for embodied intelligence applications in ROS. It allows natural language interactions and leverages Large Language Models (LLMs) for decision-making and robot control. With an easy configuration process, this framework allows for swift integration, enabling your robot to operate with it in as little as ten minutes.

llama.cpp with BakLLaVA model describes what does it see

LLM (Large Language Model) FineTuning

🚀 智谱清言 ChatGLM4大模型逆向API白嫖测试【特长:超强智能体】,支持高速流式输出、支持智能体对话、支持多轮对话、支持AI绘图、支持联网搜索、支持长文档解读、支持代码调用、支持图像解析,零配置部署,多路token支持,自动清理会话痕迹。

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

LangStream. Event-Driven Developer Platform for Building and Running LLM AI Apps. Powered by Kubernetes and Kafka.

A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD)

  • r2d4/openlm : ✩361 ✎Python ⏱2023-05-19 00:00:00

OpenAI-compatible Python client that can call any LLM

Code and datasets for “Character-LLM: A Trainable Agent for Role-Playing”

A collection of one-click self-hosted AI

🦖 X—LLM: Cutting Edge & Easy LLM Finetuning

AgentLLM is a PoC for browser-native autonomous agents

Edge full-stack LLM platform. Written in Rust

Your Engineering Gen AI Team member 🧬🤖💻

Desktop AI Assistant powered by GPT-4, GPT-4 Vision, GPT-3.5, DALL-E 3, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, autonomous agents, code and command execution, file upload and download, speech synthesis and recognition, access to Web, memory, prompt presets, plugins, assistants & more. Linux, Windows, Mac.

MindSpore online courses: Step into LLM

Low latency JSON generation using LLMs ⚡️

Chat with and teach your calendar to solve your scheduling & time problems

A curated list of awesome LLM agents.

A collection of apps powered by the LangChain LLM framework.

Quick Start LLaMA models with multiple methods, and fine-tune 7B/65B with One-Click.

A python module to repair invalid JSON, commonly used to parse the output of LLMs

👀 What LLM to use?

Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.

🚀 阿里通义千问2.5大模型逆向API白嫖测试【特长:六边形战士】,支持高速流式输出、无水印AI绘图、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。

すぐに業務活用できるビジネスユースケース集付きの安全な生成AIアプリ実装

This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as Flan-T5.

The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.

Research Trends in LLM-guided Multimodal Learning.

Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS

VectorHub is a free, open-source learning website for people (software developers to senior ML architects) interested in adding vector retrieval to their ML stack.

LLaMa/RWKV onnx models, quantization and testcase

LLMs Best Tricks

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.

Conversations with your files! Manage and run your AI presets!

An easy way to host your own AI API and expose alternative models, while being compatible with “open” AI clients.

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

A framework for creating grounded instruction based datasets and training conversational domain expert Large Language Models (LLMs).

appbuilder-sdk, 千帆AppBuilder-SDK帮助开发者灵活、快速的搭建AI原生应用

A library for easily merging multiple LLM experts, and efficiently train the merged LLM.

A model compilation solution for various hardware

ERNIE Bot Agent is a Large Language Model (LLM) Agent Framework, powered by the advanced capabilities of ERNIE Bot and the platform resources of Baidu AI Studio.

JS tokenizer for LLaMA 1 and 2

Large Language Model based Multi-Agents: A Survey of Progress and Challenges

Interface to ChatGPT from R

Production-Ready LLM Agent SDK for Every Developer

⛓️ build cognitive systems, pythonic

A high-performance inference system for large language models, designed for production environments.

Paper List for Recommend-system PreTrained Models

Framework to easily create LLM powered bots over any dataset.

活字通用大模型

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.

This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.

EdgeChains.js Typescript/Javascript production-friendly Generative AI. Based on Jsonnet. Works anywhere that Webassembly does. Prompts live declaratively & “outside code in config”. Kubernetes & edge friendly. Compatible with OpenAI GPT, Gemini, Llama2, Anthropic, Mistral and others

Domain Adapted Language Modeling Toolkit - E2E RAG

Neovim plugin for interacting with LLM’s and building editor integrated prompts.

This repository collects papers for “A Survey on Knowledge Distillation of Large Language Models”. We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.

data-to-paper: Backward-traceable AI-driven scientific research

⚡ Edgen: Local, private GenAI server alternative to OpenAI. No GPU required. Run AI models locally: LLMs (Llama2, Mistral, Mixtral…), Speech-to-text (whisper) and many others.

Jlama is a modern Java inference engine for LLMs

Galah: LLM-powered web honeypot!

PlotAI - Your Ultimate Plotting Assistant! 📊🤖 Use ChatGPT-3.5 to create plots in Python and Matplotlib directly in your Python script or notebook.

Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs

This chatbot app is built using the Llama 2 open source LLM from Meta.

GLake: optimizing GPU memory management and IO transmission.

Repo for paper “Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration”

A collection of AWESOME things about HUGE AI models.

LLM powered retrieval engine designed to process a ton of sources to collect a comprehensive list of entities.

📎🟢 Like Clippy but for the CLI. A blazing fast AI helper for your command line

Starter App to Build Your Own App to Query Doc Collections with Large Language Models (LLMs) using LlamaIndex, Langchain, OpenAI and more (MIT Licensed)

Examples and guides for using the E2B API

ChatGPT at home! Basically a better Google Nest Hub or Amazon Alexa home assistant. Built on the Raspberry Pi using the OpenAI API.

The “vicuna-installation-guide” provides step-by-step instructions for installing and configuring Vicuna 13 and 7B

🚀 秘塔AI搜索逆向API白嫖测试【特长:超强检索超长输出】,支持高速流式输出、超强联网搜索(全网or学术以及简洁、深入、研究三种模式),零配置部署,多路token支持

🔥 Building composable LLM applications & workflow with Java.

ArXiv Digest and Personalized Recommendations using Large Language Models

Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas

本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。

SOTA Math Opensource LLM

AWS Generative AI CDK Constructs are sample implementations of AWS CDK for common generative AI patterns.

Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning

A tool for generating function arguments and choosing what function to call with local LLMs

An extensive library of AI resources including books, courses, papers, guides, articles, tutorials, notebooks, AI field advancements and more.

📦 CodeBox is the simplest cloud infrastructure for your LLM Apps and Services.

A copilot for your terminal

🦜️🔗LangChain for Rust, the easiest way to write LLM-based programs in Rust

AIDE: the Machine Learning CodeGen Agent

Rill Flow is a high-performance, scalable workflow orchestration engine for distributed workloads and LLMs

Text2Text: Crosslingual NLP/G toolkit

Chat with documents (pdf, docx, txt) using ChatGPT and Langchain

🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

Your Self-Hosted, Hackable Research Agent Inspired by AutoGPT

Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.

LLM Zoo collects information of various open- and close-sourced LLMs

A joint community effort to create one central leaderboard for LLMs.

“Awesome-LLM: a curated list of Azure OpenAI & Large Language Models” 🔎References to Azure OpenAI, 🦙Large Language Models, and related 🌌 services and 🎋libraries.

Joint speech-language model - respond directly to audio!

Make your GenAI Apps Safe & Secure 🚀 Test & harden your system prompt

Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.

scalable and robust tree-based speculative decoding algorithm

This repo contains evaluation code for the paper “MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI”

[CVPR'24 Highlight] GPT4Point: A Unified Framework for Point-Language Understanding and Generation.

A collection of Deep Learning projects and resources

Open-source backend for building stateful AI apps

A Repo to store the Google Colaboratory Notebooks that I have created and shared

Fact-checking LLM outputs with self-ask

Multi-node production AI stack. Run the best of open source AI easily on your own servers. Create your own AI by fine-tuning open source models. Integrate LLMs with APIs. Run gptscript securely on the server

Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.

A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.

🌿孙思邈中文医疗大模型(Sunsimiao):提供安全、可靠、普惠的中文医疗大模型

Papers and resources related to the security and privacy of LLMs 🤖

Must-read Papers on Large Language Model (LLM) Planning.

🐳 Aurora is a [Chinese Version] MoE model. Aurora is a further work based on Mixtral-8x7B, which activates the chat capability of the model’s Chinese open domain.

Phi3 中文仓库

Official implementation of BGPT @ ICLR 2024 paper “Cumulative Reasoning With Large Language Models” (https://arxiv.org/abs/2308.04371 )

OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)

The Tree of Thoughts (ToT) framework for solving complex reasoning tasks using LLMs

A curated list of awesome instruction tuning datasets, models, papers and repositories.

Powerful Intelligent Workflow Automation

Papers and Book to look at when starting AGI 📚

AIxplora is a open-source tool which let’s you query all kind of files not limited to any length or format.

Official Implementation of ICLR 2024 paper: “Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning”

LLM.swift is a simple and readable library that allows you to interact with large language models locally with ease for macOS, iOS, watchOS, tvOS, and visionOS.

GenSim: Generating Robotic Simulation Tasks via Large Language Models

An efficient API documentation management tool that fully adheres to the OpenAPI specification and incorporates advanced LLM technology. This tool can automate the generation of API documentation, data models, and test cases, greatly enhancing development efficiency and documentation quality.

WIP - Automated Question Answering for ArXiv Papers with Large Language Models (https://arxiv.taesiri.xyz/ )

Platform for General Robot Intelligence Development

This is meant to be a comperehensive list of ChatGPT plugins and plugin-related resources.

EvalGPT is an code interpreter framework that utilizes large language models to automate the process of code-writing and execution, delivering precise results for user-defined tasks.

🚀Large language model automation and Autonomous Language Agents development framework. Build your LLM Agent Application in a pythonic way!

Paper List of Pre-trained Foundation Recommender Models

A CLI and web UI to interact with LLMs in a Chat-style interface, with code execution capabilities and other tools.

Production-ready Generative AI for local, cloud native, airgap, and edge deployments.

0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of “BitNet: Scaling 1-bit Transformers for Large Language Models” in pytorch with Llama(2) Architecture

Summarizes top stories from Hacker News using a large language model and post them to a Telegram channel.

FREE ML Courses from Top Universities in CS

🚀 LangChain for Swift. Optimized for iOS, macOS, watchOS (part) and visionOS.(beta)

Specify a github or local repo, github pull request, arXiv or Sci-Hub paper, Youtube transcript or documentation URL on the web and scrape into a text file and clipboard for easier LLM ingestion

Clipboard Conqueror is a novel copy and paste copilot alternative designed to bring your very own LLM AI assistant to any text field.

Enhance LLM agents with versatile tool APIs

Enforce structured output from LLMs 100% of the time

Akcio is a demonstration project for Retrieval Augmented Generation (RAG). It leverages the power of LLM to generate responses and uses vector databases to fetch relevant documents to enhance the quality and relevance of the output.

ICML 2024: Less: Selecting Influential Data for Targeted Instruction Tuning

A recipe to train reward models for RLHF.

Provide best practices for LMOps, as well as elegant and convenient access to the features of the Qianfan MaaS Platform. (提供大模型工具链最佳实践,以及优雅且便捷地访问千帆大模型平台)

A Test Project for a Network Security-oriented LLM Tool Emulating AutoGPT

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation

The open-source SDK for creating AI plugins and actions

Conversational AI Platform to build effective Proactive Digital Assistants using Visual LLM Chaining

The open-source serverless GPU container runtime.

Guideline following Large Language Model for Information Extraction

StarWhisper:LLM for Astronomy

An AI-powered arXiv paper summarization website with a virtual assistant for answering questions.

A full-stack Webui implementation of Large Language model, such as ChatGPT or LLaMA.

♾️ toolkit for air-gapped LLMs on consumer-grade hardware

(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions

Official Repo for ICML 2024 paper “Executable Code Actions Elicit Better LLM Agents” by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.

从0到1构建一个MiniLLM

An Awesome Collection for LLM Survey

语言模型中文认知能力分析

Let the LLM review your code.

Integrated AI environment in the terminal. Build, test and instruct agents.

💭 Retrieval augmented generation (RAG) and language model powered search applications

FireAct: Toward Language Agent Fine-tuning

an intro to retrieval augmented large language model

LLM-powered Markdown editor

syntactic sugar 🍭 for langchain

✨ A beautiful UI for ChatGPT and other conversational models

Official Code of “STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases”

[CVPR2024 Highlight] Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration

SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks

[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

A truly natural scripting language

🤖🌊 aiFlows: The building blocks of your collaborative AI

VSCode extension of Quack Companion 💻 Turn your team insights into a portable plug-and-play context for code generation. Alternative to GitHub Copilot powered by OSS LLMs (Mistral, Gemma, etc.), served with Ollama.

🤖 Awesome list of AGI Agents. Agents 精选资源合集.

A set of lessons aimed at anyone learning LLM and generative AI concepts, with sections on operations and security, as well as development.

AI assistant that can query visual datasets, search the FiftyOne docs, and answer general computer vision questions

多维度中文对齐评测基准 | Benchmarking Chinese Alignment of LLMs

Use GPT-4 to stream diagrams, instead of tokens, in real-time! (UIST 2023 Paper)

比做算法的懂工程落地,比做工程的懂算法模型。

🔐Free GPT-3.5 chat with your docs (PDF, WORD, CSV, TXT)

Lightrail Monorepo

Run Alpaca LLM in LangChain

🧠 AI-powered CLI git wrapper, boilerplate code generator, chat history manager, and code search engine to streamline your dev workflow 🌊

Official implementation of TransNormerLLM: A Faster and Better LLM

MyGirl GPT is a project to build your own AI girlfriend Running on Your Personal Server with local LLM.

An LLM-based agent that will install the tools it needs.

Create-tsi is a generative AI RAG toolkit which generates AI Applications with low code.

Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset

A list of top AI agents

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

Framework to bring LLM applications to production

Continuous Integration for LLM powered applications

The code of our paper “InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory”

Large Language Model guided Protocol Fuzzing (NDSS'24)

A fully in-browser privacy solution to make Conversational AI privacy-friendly

⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.

Practical Llama 3 inference in Java

Download, Edit, and Generate Videos, Images and Sounds, in bulk using AI

Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.

🤖 Free Search with AI, 💡 Open-Source Perplexity, 📍 Support Ollama/SearXNG, Support Docker deployment. 让AI大模型和搜索引擎回答你的问题,支持本地大模型(Ollama)、聚合搜索引擎SearXNG,支持Docker一键部署。

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Here you can find all the Tutorials for Haystack 📓

An AI framework for building cool things.

Helping Ethical Hackers use LLMs in 50 Lines of Code or less..

Easy token price estimates for LLMs

Examples on how to use LangChain and Ray

GGUF implementation in C as a library and a tools CLI program

Load local LLMs effortlessly in a Jupyter notebook for testing purposes alongside Langchain or other agents. Contains Oobagooga and KoboldAI versions of the langchain notebooks with examples.

Ruby on Rails 7-based ChatGPT Bot Platform

PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections, complete with options for text validation and hallucination filtering.

Convenience scripts to finetune (chat-)LLaMa3 and other models for any language

An implementation of bucketMul LLM inference

Autonomous Agents (LLMs) research papers. Updated Daily.

An innovative open-source Code Interpreter with (GPT,Gemini,Claude,LLaMa) models.

A simple and declarative way to control LLMs

  • r2d4/llm.ts : ✩203 ✎TypeScript ⏱2023-05-09 00:00:00

Call any LLM with a single API. Zero dependencies.

LLM全栈优质资源汇总

Humanable Chat Generative-model Fine-tuning | LLM微调

UP-TO-DATE LLM Watermark paper. 🔥🔥🔥

Tune LLM in few lines of code

LLM-powered NPCs running on your hardware

BAML is a configuration file format to write better and cleaner LLM functions. Check out the promptfiddle.com playground

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna

Step by step explanation/tutorial of llama2.c

  • l3vels/L3AGI : ✩196 ✎TypeScript ⏱2024-04-18 00:00:00

Open-source framework to make AI agents' team collaboration as effective as human collaboration.

End-to-end platform for building voice first multimodal agents

an MLOps/LLMOps platform

🦜💯 Flex those feathers!

LLM verified with Monte Carlo Tree Search

Repository of instructions for Programming-specific GPT models

  • da-z/mlx-ui : ✩194 ✎Python ⏱2024-05-18 00:00:00

A simple UI / Web / Frontend for MLX mlx-lm using Streamlit.

[ICLR 2024 (Spotlight)] “Frozen Transformers in Language Models are Effective Visual Encoder Layers”

[ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets

Neuralink for your AI Agents - LangChain - Autogen - CrewAI

GenAI powered OpenSource IDE for API first workflows

Joplin (note-taking) assistant running a very intelligent system (OpenAI/GPT, Hugging Face, Gemini, Llama, Universal Sentence Encoder, etc.)

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

A OpenAI API compatible REST server for llama.

A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG

Learning to Drive with GPT

Provide Efficient LLM Fine-Tune via Multi-LoRA Optimization

The best free SaaS base for GenAI, LLM, and other AI Services. Ship your next Generative AI startups in days.

gpt-code-assistant is an open-source coding assistant leveraging language models to search, retrieve, explore and understand any codebase.

Dropbox LLM Security research code and results

Benchmarking Legal Knowledge of Large Language Models

The Open Assistant API is a ready-to-use, open-source, self-hosted agent/gpts orchestration creation framework, supporting customized extensions for LLM, RAG, function call, and tools capabilities. It also supports seamless integration with the openai/langchain sdk.

ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.

🦭 Full stack AI voice chatbot (speech-to-text, LLM, text-to-speech) with integrations to Auth0, OpenAI, Google Cloud and Stripe - Web App, Web API and AI API

We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.

🦀 A curated list of Rust tools, libraries, and frameworks for working with LLMs, GPT, AI

LLaMA-2 in native Go

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

LLMOps with Prompt Flow is a “LLMOps template and guidance” to help you build LLM-infused apps using Prompt Flow. It offers a range of features including Centralized Code Hosting, Lifecycle Management, Variant and Hyperparameter Experimentation, A/B Deployment, reporting for all runs and experiments and so on.

Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-context learning, visual reasoning, foundational models, and more. Stay updated with the latest advancement.

LM Studio TypeScript SDK

Chat with your data privately using MPT-30b

SingularGPT is a open source project that automates your device using ChatGPT & GPT-4.

The React library for LLMs

ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model

金融财报问答大模型LLM

Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.

A GUI application to easily try out Facebook’s LLaMA models.

ComPromptMized: Unleashing Zero-click Worms that Target GenAI-Powered Applications

A voice-enabled chatbot application built using of 🦜️🔗 LangChain, text-to-speech, and speech-to-text models from 🤗 Hugging Face, and 🍱 BentoML.

Chocolate Factory is a cutting-edge LLM toolkit designed to empower you in creating your very own AI assistant.Chocolate Factory 是一款开源的 LLM 应用开发框架,旨在帮助您轻松打造强大的软件开发 SDLC + LLM 生成助手。无论您是需要生成前端页面、后端 API、SQL 图表,还是测试用例数据,Chocolate Factory 都能满足您的需求。

Langchain realworld examples in JS

  • iflytek/VLE : ✩175 ✎Python ⏱2023-03-13 00:00:00

VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)

[CVPR 2024] “LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning”; an interactive Large Language 3D Assistant.

A curated list of MLSecOps tools, articles and other resources on security applied to Machine Learning and MLOps systems.

🧠 LLMFuzzer - Fuzzing Framework for Large Language Models 🧠 LLMFuzzer is the first open-source fuzzing framework specifically designed for Large Language Models (LLMs), especially for their integrations in applications via LLM APIs. 🚀💥

🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.

QA based on local knowledge and LLM.

A Python application which interacts with the llama.cpp library to provide virtual assistant capabilities through the command line. It allows you to ask questions and receive intelligent responses, as well as generate Linux commands based on your prompts.

Source Code of Skyrim’s Inworld AI Mod. Contains C++, Papyrus and Typescript code

A Language Agent for Autonomous Driving

React Native binding of llama.cpp

LangChain beginner’s tutorial for javascript/typescript

A collection of sample programs, notebooks, and tools which highlight the power of the MAX Platform

Conversational AI tooling & personas built on Cohere’s LLMs

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点对商品进行解说并激发用户的购买意愿的卖货主播模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊

A paper & resource list of large language models, including course, paper, demo, figures

LLaVA server (llama.cpp).

LangChain x gpt4free

Fast & more realistic evaluation of chat language models. Includes leaderboard.

RAG orchestration framework ⛵️

Redis Vector Library (RedisVL) interfaces with Redis' vector database for realtime semantic search, RAG, and recommendation systems.

Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible for beginners and as seamlesscustomizable and as possible for advanced users.

Building applications with LLMs through composability, in Kotlin, Scala, …

Semantic layer on top of a graph database to provide an LLM with a set of robust tools to interact with the database

The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.

Fast ChatGPT UI with support for both OpenAI and Azure OpenAI. 快速的ChatGPT UI,支持OpenAI和Azure OpenAI。

Curated collection of AI dev tools from YC companies, aiming to serve as a reliable starting point for LLM/ML developers

  • chieapp/chie : ✩162 ✎TypeScript ⏱2023-08-27 00:00:00

An extensive desktop app for ChatGPT and other LLMs.

[ACL 2024] Benchmarking the Hallucination of Chinese Large Language Models via Unconstrained Generation

A web UI Project In order to learn the large language model. This project includes features such as chat, quantization, fine-tuning, prompt engineering templates, and multimodality.

The RunPod worker template for serving our large language model endpoints. Powered by vLLM.

Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.

The official TypeScript/Node client for the Pinecone vector database

WebAssembly binding for llama.cpp - Enabling in-browser LLM inference

An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomer

AI Native database for embedding vectors

MQL tool is designed to generate SQL queries directly from natural language inputs.

Long-Novel-GPT是一个基于GPT等大语言模型的长篇小说生成器。它采用层次化的大纲/章节/正文结构,以把握长篇小说的连贯剧情,通过上下文管理优化API调用成本,并根据自身或用户反馈不断进行优化,直至达到预定目标。

Python package that provides (unofficial) API access to the GPT-4 through chat.openai.com. Works with langchain. Supports browsing, DALL-E 3, plugins, continuing generation.

LoRA and DoRA from Scratch Implementations

A collection of C# notebooks to get you started with Semantic Kernel quickly.

The creative suite for character-driven AI experiences.

VisualWebArena is a benchmark for multimodal agents.

Complex question answering in LLMs with enhanced reasoning and information-seeking capabilities.

Source code for the paper “Empowering LLM to use Smartphone for Intelligent Task Automation”

A conversational semi-autonomous developer assistant. AI pair programming without the copypasta.

The paper list of the review on LLMs in medicine - “Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A Review”.

Code and Data artifact for NeurIPS 2023 paper - “Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context”. multispy is a lsp client library in Python intended to be used to build applications around language servers.

RAG LLM Ops App for easy deployment and testing

A list of the most popular AI Topic repositories on GitHub based on the number of stars they have received.| AI相关主题Github仓库排名,每日自动更新。

ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.

Quanta is an open-source CMS with ChatGPT and Social Media (Fediverse) features

An open-source project dedicated to build foundational large language model for natural science, mainly in physics, chemistry and material science.

Knowledge work automation with AI agents

LLM for Long Text Summary (Comprehensive Bulleted Notes) : Walkthrough and Model Rankings

Laravel CRM - PHP CRM - AI CRM - Metadata CRM. Iceburg CRM is Open Source, metadata driven, extendable, unlimited relationships, convertable modules, 29 default themes, light/dark themes. It also have optional AI Abilities Using AI describe your CRM and use AI to Create it.

In LangChain for LLM Application Development, you will gain essential skills in expanding the use cases and capabilities of language models in application development using the LangChain framework.

It’s like Auto-GPT met Brew. The easiest and fastest way to get started with AutoGPT with any backend of your choice & whatever plugins you may need

Chat to Compose Video

A template to create any LLM Inference Web Apps using Python only

聯發創新基地(MediaTek Research) 致力於研究基礎模型。我們將研究體現在適合繁體中文使用者的模型上,並在使用權許可的情況下,提供模型給學術界研究或產業界使用。

Social and customizable AI writing assistant! ✍️

Structured LLM APIs

Port of Facebook’s LLaMA (Large Language Model Meta AI) in Golang with embedded C/C++

LLM-based CLI utility for simulation worlds creation.

🤖 Cybersecurity Automation & Investigation Assistant

SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.

Embed arbitrary modalities (images, audio, documents, etc) into large language models.

Code and datasets for paper “K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization” in WSDM-2024

Run any Large Language Model behind a unified API

llama_cpp provides Ruby bindings for llama.cpp

Speakeasy CLI - Enterprise developer experience for your API

Fast Inference of MoE Models with CPU-GPU Orchestration

Immersive workshop showcasing the remarkable potential of integrating SoTA foundation models to enhance product experiences and streamline backend workflows. Leverages Microsoft’s Copilot stack, Semantic Kernel and Azure primitives to offer an engaging and comprehensive introduction to AI-infused app development and deployment

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future – PRs welcome).

A recipe to train reward models for RLHF.

Repository for Chat LLaMA - training a LoRA for the LLaMA (1 or 2) models on HuggingFace with 8-bit or 4-bit quantization. Research only.

LLM Frontend in a single html file

Power of AI agents and a new way of search for science and technology.

A voice chatbot based on GPT4All and talkGPT, running on your local pc!

LongQLoRA: Extent Context Length of LLMs Efficiently

Harnessing the Memory Power of the Camelids

Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorDBs and more.. Integrate using Typescript, Python. 🚀💻📊

the official implementation of the RecSys 2023 paper “Uncovering ChatGPT’s Capabilities in Recommender Systems”

GPT Interviewer - Practice interview with AI interviewer based on job descriptions and resume

Open-source software engineer

本项目开源基于NextJS的前端, 希望能够提供一个用于生成式AI的文字转视频, 尤其是电影从编剧到视频生成的Web前端平台参考。Everyone can become a director. The Nextjs front-end of an AI driven platform for automatic movie/video generation (form GPT script generation to text2video movie generation).这是一个免费试用AI视频创作平台,集成了基于GPT的视频剧本生成和视频生成功能。 我们的理想是让每个人都能成为导演,以最快的方式将日常中的任何创意转化为高质量的视频, 无论是电影、营销视频、还是自媒体视频。

🪶 Lightweight OpenAI drop-in replacement for Kubernetes

(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Multilingual Medicine: Model, Dataset, Benchmark, Code

Fuji is an AI agent that lives in your browser’s sidepanel. You can now get tasks done online with a single command!

Build LLM-powered robots in your garage with MachinaScript For Robots!

Open-source framework that gives you AI Agents that help you navigate decision-making, get personalized goals and execute them

📚 A curated list of papers & technical articles on AI Quality & Safety

Optimise Shopping feeds with Generative AI

A companion guide for the blog post series, LangChain Decoded.

  • MxDkl/pls : ✩139 ✎Shell ⏱2024-05-21 00:00:00

CLI to convert natural language to terminal commands

Visualization for a Retrieval-Augmented Generation (RAG) Assistant 🤖➕📚🟰❤️

  • npi-ai/npi : ✩138 ✎Python ⏱2024-05-24 00:00:00

Tool use APIs platform for AI Agent

Run Large-Language Models (LLMs) 🚀 directly in your browser!

Entire source code for NewsNotFound’s article generation process ✍

Test and evaluate LLMs and model configurations, across all the scenarios that matter for your application

A few quick scripts focused on testing TensorFlow/PyTorch/Llama 2 on macOS.

ChatData 🔍 📖 brings RAG to real applications with FREE✨ knowledge bases. Now enjoy your chat with 6 million wikipedia pages and 2 million arxiv papers.

A sandbox repo for grounded question answering with Cohere and Google Search

KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents

🤖 Discover how to apply your LLM app skills on Kubernetes!

A fuzzy key value store based on semantic similarity rather lexical equality.

AI-to-AI Testing | Simulation framework for LLM-based applications

MarkLLM: An Open-Source Toolkit for LLM Watermarking.

XTTSv2 Extension for oobabooga text-generation-webui

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

✨ Local Zapier replacement written in Rust to make local AI do way more than chat

Efficient translation tool based on ChatGPT API

StyleLLM文风大模型:基于大语言模型的文本风格迁移项目。Text style transfer base on Large Language Model. #文字修饰 # 润色 #风格模仿

[ICLR 2024 Spotlight] Unified Human-Scene Interaction via Prompted Chain-of-Contacts

The PDFChat app allows you to chat with your PDF files in natural language.

Finetune LLaMA-7B with Chinese instruction datasets

Gateway for secure & reliable communications with OpenAI and other LLM providers

LLM Testing SDK that helps you write and run tests to monitor your LLM app in production

🔥 React library of AI components 🔥

LingoWhale-8B: Open Bilingual LLMs | 开源双语预训练大模型

PHP Framework created for building web applications with AI and ML capabilities. Built-in Web Server and WebSocket server.

Run LLMs locally. A clojure wrapper for llama.cpp.

An extension for oobabooga/text-generation-webui that enables the LLM to search the web using DuckDuckGo

LLM-based text extraction from unstructured data like PDFs, Words and HTMLs. Transform and cluster the text into your desired format. Less information loss, more interpretation, and faster R&D!

🧁👋 Welcome to CupcakeAGI, where we bake up some sweet and creamy AGI goodness! 🍰🤖

🚀 跃问YueWen 多模态大模型逆向API白嫖测试【特长:超强多模态】,支持高速流式输出、联网搜索、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。

Inference Llama 2 in one file of pure Zig

✨ An elegant async Python wrapper for Google Gemini web app

Product analytics for AI Assistants

TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

LLM for Drug Editing, ICLR 2024

🌌️Fuzz4All: Universal Fuzzing with Large Language Models

🚀 DeepSeek-V2大模型逆向API白嫖测试【特长:GPT4平替】,支持高速流式输出、多轮对话,零配置部署,多路token支持。

⚡Chat with GitHub Repo Using 200k context window of Claude instead of RAG!⚡

Biomedical LLM, A Bilingual (Chinese and English) Fine-Tuned Large Language Model for Diverse Biomedical Tasks

Tiledesk’s open-source visual, no-code designer where LLM/GPT AI meets a flexible ‘graph’ approach. Create conversations and automations effortlessly – a Voiceflow and Botpress alternative.

Tooling backed by an LLM for performing natural language searches against compiled target binaries. Search for encryption logic, password strings, vulnerabilities, etc.

This open-source repository offers reference code for integrating workplace datastores with Cohere’s LLMs, enabling developers and businesses to perform seamless retrieval-augmented generation (RAG) on their own data.

Create knowledge graphs with LLMs

  • c-3lab/dim : ✩121 ✎TypeScript ⏱2024-01-21 00:00:00

📦 dim: Manage the open data in your project like a package manager.

Large Language Model (LLM) Inference API and Chatbot

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

Next-token prediction in JavaScript — build fast language and diffusion models.

Typescript and .NET implementation of BPE tokenizer for OpenAI LLMs.

Instruct-tune LLaMA on consumer hardware with shareGPT data

1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition

AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more

OpsPilot是WeOps团队开源的一个基于深度学习与LLM技术的智能运维助理,以运维大脑的形态,联动各类运维系统,提供智能化运维的能力支撑。

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.

Implement OpenAI APIs and plugin-enabled ChatGPT with open source LLM and other models.

llm-local

LLM App templates for RAG, knowledge mining, and stream analytics. Ready to run with Docker,⚡in sync with your data sources.

llamacpp

  • janhq/jan : ✩18811 ✎TypeScript ⏱2024-05-25 00:00:00

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)

A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

Chat with your favourite LLaMA models in a native macOS app

A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

An innovative library for efficient LLM inference via low-bit quantization

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

AI-powered cybersecurity chatbot designed to provide helpful and accurate answers to your cybersecurity-related queries and also do code analysis and scan analysis.

Inference Vision Transformer (ViT) in plain C/C++ with ggml

AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.

A suite of custom nodes for ConfyUI that includes GPT text-prompt generation, LoadVideo, SaveVideo, LoadFramesFromFolder and FrameInterpolator

A Discord Bot for chatting with LLaMA, Vicuna, Alpaca, MPT, or any other Large Language Model (LLM) supported by text-generation-webui or llama.cpp.

Obsidian Local LLM is a plugin for Obsidian that provides access to a powerful neural network, allowing users to generate text in a wide range of styles and formats using a local LLM.

An OpenAI-like LLaMA inference API

LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.

VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.

LLM telegram bot

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

Erudito: Easy API/CLI to ask questions about your documentation

An AI-app that allows you to upload a PDF and ask questions about it. It uses StableVicuna 13B and runs locally.

Stateful load balancer custom-tailored for llama.cpp

An AI chatbot for Signal powered by Google Bard, Bing Chat, ChatGPT, HuggingChat, and llama.cpp

LocalAGI:Locally run AGI powered by LLaMA, ChatGLM and more. | 基于 ChatGLM, LLaMA 大模型的本地运行的 AGI

Completion After Prompt Probability. Make your LLM make a choice

openai chatgpt or local llm(llama.cpp gguf format)+TTS+STT+Word+Excel

📚 Local PDF-Integrated Chat Bot: Secure Conversations and Document Assistance with LLM-Powered Privacy

Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)

Python bindings for llama.cpp

ezlocalai is an easy to set up local artificial intelligence server with OpenAI Style Endpoints.

LocalChat is a ChatGPT-like chat that runs on your computer

.NET wrapper for LLaMA.cpp for LLaMA language model inference on CPU. 🦙

LLM in Godot

ollama

A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。

🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.

The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

Private & local AI personal knowledge management app.

Helm chart for Ollama on Kubernetes

🚀 基于 LLM 大语言模型的知识库问答系统。开箱即用,支持快速嵌入到第三方业务系统,1Panel 官方出品。

Java version of LangChain

Artificial Intelligence Infrastructure-as-Code Generator.

All-in-one AI-Powered CLI Chat & Copilot that integrates 20+ AI platforms, including OpenAI, Azure-OpenAI, Gemini, Claude, Mistral, Cohere, VertexAI, Bedrock, Ollama, Ernie, Qianwen, Deepseek…

Small self-contained pure-Go web server with Lua, Teal, Markdown, Ollama, HTTP/2, QUIC, Redis and PostgreSQL support

Ollama Python library

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。

✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.

The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but completely free and 100% private.

Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.

Ollama JavaScript library

🔍 AI search engine - self-host with local or cloud LLMs

👾 Automated README file generator, powered by large language model APIs.

A RAG LLM co-pilot for browsing the web, powered by local LLMs

VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama

The TypeScript library for building AI applications.

A macOS app for interacting with the Ollama models

Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

Chat with your documents using local AI

Fractal Graph Desktop for Ai-Agents, Web-Browsing, Note-Taking, and Code.

An open-source alternative to GitHub copilot that runs locally.

A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.

Open source Python SDK for agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI, Langchain, and Autogen

Open source guides/codes for mastering deep learning to deploying deep learning in production in PyTorch, Python, Apptainer, and more.

a text-based terminal client for Ollama

Use your locally running AI models to assist you in your web browsing

JS/TS library to make to easy to build with LLMs. Full support for various LLMs and VectorDBs, Agents, Function Calling, Chain-of-Thought, RAG, Semantic Router and more. Based on the popular Stanford DSP paper. Create and compose efficient prompts using prompt signatures. 🌵 🦙 🔥 ❤️ 🖖🏼

Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.

chat web app for teams, sass with user management and ratelimit, support chatgpt(openai & azure), claude, gemini and ollama model

A personal wearable AI that runs locally

  • run-ai/genv : ✩432 ✎Python ⏱2024-05-16 00:00:00

GPU environment and cluster management with LLM support

Fully-featured, beautiful web interface for Ollama LLMs - built with NextJS. Deploy with a single click.

From anywhere you can type, query and stream the output of an LLM or any other script

A lightweight library that leverages Language Models (LLMs) to enable natural language interactions, allowing you to source and converse with data.

Ingest files for retrieval augmented generation (RAG) with open-source Large Language Models (LLMs), all without 3rd parties or sensitive data leaving your network.

Local first semantic code search and chat powered by vector embeddings and LLMs

A Web Interface for chatting with your local LLMs via the ollama API

Mac compatible Ollama Voice

RestAI is an AIaaS (AI as a Service) open-source platform. Built on top of LlamaIndex, Ollama and HF Pipelines. Supports any public LLM supported by LlamaIndex and any local LLM suported by Ollama. Precise embeddings usage and tuning.

Text analytics for LLM apps. PostHog for prompts. Extract evaluations, intents and events from text messages. phospho leverages LLM (OpenAI, MistralAI, Ollama, etc.)

An Obsidian plugin to interact with your privacy focused AI-Assistant making your second brain even smarter!

Ollama API bindings for .NET

  • joone/loz : ✩247 ✎TypeScript ⏱2024-03-22 00:00:00

Loz is a command-line tool that enables your preferred LLM to execute system commands and utilize Unix pipes, integrating AI capabilities with other Unix tools.

🔥 TUI interface for LLMs written in Rust

Open source chat kit engineered for seamless interaction with AI models.

Welcome to the Ollama Docker Compose Setup! This project simplifies the deployment of Ollama using Docker Compose, making it easy to run Ollama with all its dependencies in a containerized environment

Generate and brainstorm ideas while creating your notes using Large Language Models (LLMs) from Ollama, LM Studio, Anthropic, OpenAI, Mistral AI, and more for Obsidian.

🦙 Ollama interfaces for Neovim

Chrome Extension to Summarize or Chat with Web Pages/Local Documents Using locally running LLMs. Keep all of your data and conversations private. 🔐

Minimalistic UI for Ollama LMs - This powerful react interface for LLMs drastically improves the chatbot experience and works offline.

A multi-platform desktop application to evaluate and compare LLM models, written in Rust and React.

open-source browser extension that leverages the power of the AI to generate engaging replies for social media growth.

User Interface made for Ollama.ai using Swift

llmcord.py • Talk to LLMs with your friends!

Discord AI chatbot using Ollama

A sleek desktop music player and tagger for offline music 🪕 With experimental features like map view, GPT analysis, artist toolkit. Built with Svelte and Tauri

Build your own ChatPDF and run them locally

🦙 Ollama Telegram bot, with advanced configuration

An LLM agnostic desktop and mobile client.

The easiest way to run the fastest MLX-based LLMs locally

Easyreadme helps you simplify README creation and generate visually stunning ones with the help of AI and elegant pre-designed templates.

Logseq plugin to integerate with ollama

Chat with Ollama/Huggingface/TextGen/OpenAI/Gemini/Anthropic GPT Neovim Plugin: Effortless Natural Language Generation with LLM API

Ollama client for Swift

LLM-powered code documentation generation

Create keyboard shortcuts for an LLM using OpenAI GPT, Ollama, HuggingFace with Automator on macOS.

A LLDB plugin which brings LLMs to LLDB

Ollama conversation integration for Home Assistant

Java library for interacting with Ollama server.

Booster - open platform for serving LLM models

Samples showing how to build Java applications powered by Generative AI and LLMs using Spring AI and Spring Boot.

A NodeJS RAG framework to easily work with LLMs and embeddings

Running local Language Learning Models to perform Retrieval-Augmented Generation

RAG-GPT, leveraging LLM and RAG technology, learns from user-customized knowledge bases to provide contextually relevant answers for a wide range of queries, ensuring rapid and accurate information retrieval.

A Ruby gem for interacting with Ollama’s API that allows you to run open source AI LLMs (Large Language Models) locally.

✨ A Copilot Chat experience in Neovim. Supports Anthropic, Ollama and OpenAI

Run LLMs locally with as little friction as possible.

Chat with multiple PDFs locally

This code implements a Local LLM Selector from the list of Local Installed Ollama LLMs for your specific user Query

Run an AI-powered Discord bot from the comfort of your laptop.

Jupyter Notebooks for Ollama integration

Agents-Flex is an elegant LLM Application Framework like LangChain with Java.

turnkey self-hosted offline transcription and diarization service with llm summary

LLMX; Easiest 3rd party Local LLM UI for the web!

A bot that accepts PDF docs and lets you ask questions on it.

무료로 한국어🇰🇷 파인튜닝 모델 받아서 로컬 LLM 호스팅. LangServe, Ollama, streamlit + RAG

bott: Your Terminal Copilot

Summarize webpages from specified URLs using the LangChain framework and the ChatOllama model

The library for character-driven AI experiences.

A simple experiment on letting two local LLM have a conversation about anything!

Swift powered native macOS client for Ollama, ChatGPT and compatible API-backends

基于 Java Web 项目的 SpringBoot 框架初始化模板,该模板整合了常用的框架(Mybatis-Plus、ShardingSphere、Redis、RabbitMQ、Elasticsearch、SaToken、OSS、Caffeine以及MongoDB等),同时接入了Spring AI,该模板适用于前后端分离项目启动开发,保证大家在此基础上能够快速开发自己的项目,同时也适合入门学习,本项目会由作者持续更新。

Getting started with Ollama for Python - a short tutorial for setting up Ollama for Python

Ollama Cloud is a Highly Scalable Cloud-native Stack for Ollama

JavaScript implementation of LiteLLM.

Ruby Implementation of Nano Bots: small, AI-powered bots that can be easily shared as a single file, designed to support multiple providers such as Cohere Command, Google Gemini, Maritaca AI MariTalk, Mistral AI, Ollama, OpenAI ChatGPT, and others, with support for calling tools (functions).

Starter examples for using Next.js and the Vercel AI SDK with Ollama and ModelFusion.

Chat effortlessly, execute commands, and interpret code with Llama3, Phi3, and more - your local AI assistant. Enjoy seamless interaction while ensuring ultimate privacy

A simple Web / UI / App / Frontend to Ollama.

Streamline Coding & Speed Up Dev Process. Your Own Personal Senior Engineer For Free!

Create your own GPT intelligent assistants using Azure OpenAI, Ollama, and local models, build and manage local knowledge bases, and expand your horizons with AI search engines. 使用 Azure OpenAI、Ollama以及本地模型创建自己的 GPT 智能助手,搭建和管理本地知识库,并使用 AI 搜索引擎扩展你的视野。

Open source alternative to Perplexity AI with ability to run locally

A Reactive CLI that generates git commit messages with Ollama, ChatGPT, Gemini, Claude, Mistral and other AI

Implementing Ollama and Agents to create a blogging bot

Go module for fetching embeddings from embeddings providers

Odin Runes, a java-based GPT client, liberates you from vendor lock-in, allowing seamless interaction with your preferred GPT model right through your favorite text editor. There is more: It also facilitates prompt-engineering by extracting context from diverse sources using technologies such as OCR, enhancing overall productivity and saving costs.

A multi-modal chat application enabling users to create custom agents, and integrate with local LLMs (Local Language Models), as well as OpenAI models.

A simple to use Ollama autocompletion engine with options exposed and streaming functionality

ScrapeGPT is a RAG-based Telegram bot designed to scrape and analyze websites, then answer questions based on the scraped content. The bot utilizes Retrieval Augmented Generation and webscraping to return natural language answers to the user’s queries.

“Just hoof it!” - A spotlight like interface to Ollama

比简单更简单,通过 Ollama 不需要显卡轻松在你的电脑上运行 LLM。

FreeGenius AI, an advanced AI assistant that can talk and take multi-step actions. Supports numerous open-source LLMs via Llama.cpp or Ollama or Groq Cloud API, with optional integration with AutoGen agents, OpenAI API, Google Gemini Pro and unlimited plugins.

A terminal utility for intelligent shell command generation

A nifty little library for working with Ollama in Elixir.

Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫

LLM Benchmark for Throughput via Ollama (Local LLMs)

Talk to your Obsidian notes!

Some experiments around .NET Aspire to learn.

QA-Pilot is an interactive chat project that leverages online/local LLM for rapid understanding and navigation of GitHub code repository or compressed file resource(e.g. xz, zip).

100% free, local & offline voice assistant with speech recognition

🎰Remix Antd Admin is a website building solution based on Remix、Antd、Prisma. (WIP)

🤖📝 A markdown editor powered by AI (Ollama)

Shell wrapper for OpenAI’s ChatGPT, DALL-E, Whisper, and TTS. Features LocalAI, Ollama, Gemini, and Mistral integration.

run ollama & gguf easily with a single command

A Ollama client for Android!

LLM-Bash - A Bash Wrapper around llm and Ollama for your code editor.

A nix flake for https://github.com/ollama/ollama

An Ollama client made with GTK4 and Adwaita

ClicShopping AI(tm) is an Ecommerce OpenSource and powerfull Solution e-commerce B2B / B2C / B2B-B2C using generative-AI and entiment Analysis. The solution is based on a modern, responsive design that will allow you to have a great website on all mobile device, desktop and easy to install.

Tutorials for PandasAI

Deploy a RESTful API Server to interact with Ollama and Stable Diffusion

  • ad-si/cai : ✩35 ✎Rust ⏱2024-04-30 00:00:00

The fastest CLI tool for prompting LLMs. Including support for prompting several LLMs at once!

Deploy your very own ChatGPT-Style Web Interface for Ollama 🦙

Ollero is a nvim plugin for integrating ollama with neovim locally

Documentation on setting up an LLM server on Debian from scratch, using Ollama, Open WebUI, and OpenedAI Speech.

AI Agent capable of automating various tasks using openai function call feature

使用Ollama Rest 接口为Bob翻译

This is an encrypted Angular 17 PWA messenger application with an AI friend that can work offline and has a reactive Spring Boot 3 backend with a MongoDB to store the messages in a clean architecture. With a Kubernetes setup. Structurizr is used for C4 diagrams.

CrewAI agents that gather and analyze YouTube comments to generate insights to inform better content creation.

Some Cool Semantic Kernel Plugins

Unleash the power of FOSS language models on your local machine

AI-augmented, conversational information retrieval and data exploration

Use google sheets as a gui for crewAI

aitelegrambot is a telegram bot which uses Ollama as its backend.

Boost productivity and power your workflow with Nano Bots for Sublime Text: small, AI-powered bots that can be easily shared as a single file, designed to support multiple providers such as Cohere Command, Google Gemini, Maritaca AI MariTalk, Mistral AI, Ollama, OpenAI ChatGPT, and others, with support for calling tools (functions).

AI Chatbot that helps you learn how to use Dagger

Langchain Models for RAGs and Agents

Build low-code, advanced AI applications and workflows using any OpenAI API-compatible framework.

Streamlit UI for Ollama that has support for vision and chat models

VT.ai - Multimodal AI Chatbot

Chat with your Documents(PDF, TXT, DOCX, ODT, PPTX etc), Websites and Youtube Chat too!, CSV files. Uses langchain, Ollama, Groq, Gemini, Streamlit. Llama3 available

Ollama Shell Helper (osh) : English to Unix-like Shell Commands translation using Local LLMs with Ollama

A simple, lightweight shell script to interact with OpenAI or Ollama or Mistral AI from the terminal, and enhancing intelligent system management without any dependencies(pure shell).

A Pull Request summarizer powered by Ollama and Llama2

Infinite Craft but in Pyside6 and Python with local LLM (llama2 & others) using Ollama

Designed for offline use, this RAG application template is based on Andrej Baranovskij’s tutorials. It offers a starting point for building your own local RAG pipeline, independent of online APIs and cloud-based LLM services like OpenAI.

Examples of RAG using LangChain with local LLMs - Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B

Ollama app for macos

Ask Ollama about your local text files

AI Commits with ollama VS Extension.

Open spec for Nano Bots: small, AI-powered bots that can be easily shared as a single file, designed to support multiple providers such as Cohere Command, Google Gemini, Maritaca AI MariTalk, Mistral AI, Ollama, OpenAI ChatGPT, and others, with support for calling tools (functions).

Repository for the workshop “Unlocking the power of AI: Private conversations with your docs using Langchain JS” by @pattyneta

VSCode coding companion for software teams 🦆 Turn your team insights into a portable plug-and-play context for code generation. Alternative to GitHub Copilot & OpenAI GPT powered by OSS LLMs (Phi 3, Llama 3, CodeQwen, Mistral, etc.), made with ❤️ using FastAPI & Ollama.

A minimal web-UI for talking to Ollama servers

AI 应用示例合集

Neovim Plugin to interact with Ollama API 🦙

A Ghidra script that enables the analysis of selected functions and instructions using locally-hosted Language Model Models (LLMs). It aims to make reverse-engineering more efficient by using Ollama’s API directly within Ghidra.

Go vs Rust AI bot voice conversation

Run multiple open source large language models concurrently powered by Ollama

Web UI for working with large language models

Benchmark your local LLMs.

Mindjet Mindmanager automation and OpenAI / LLM Integration

Interact with an LLM service

ExcelChat - Chat w/ your excel file

GenKit is a library that abstracts away all the differences across generative AI platforms.

Watch and hear endless conversations between two ollamas, hence the Two-Way Conversation Engine (TWICE)

parrot.nvim 🦜 - the plugin that brings stochastic parrots to Neovim.

Open Source Component-Driven Code Generator; It supports OpenAI, Ollama, ChakraUI, MaterialUI, BuilderIO, and more.

Unified API for LLM at Scale

Build your personal AI with the llama2 model 🤖. Easy to use, and perfect for your development needs!

Speak (speech-to-text) to Ollama LLMs in any lanaguage - Streamlit app

Precis is an AI-enabled RSS reader

A UI for Ollama on Mac

Advanced AI email assistant using Groq for responsive replies, Llama for contextual information retrieval, and RAG with LangChain for enhanced accuracy.

A project to show howto use SpringAI with OpenAI to chat with the documents in a library. Documents are stored in a normal/vector database. The AI is used to create embeddings from documents that are stored in the vector database. The vector database is used to query for the nearest document. That document is used by the AI to generate the answer.

Local AI pair programming tool

A simple, intuitive toolkit for quickly implementing LLM powered applications.

Practical step-by-step LangChain guides

A simple RAG chatbot that can retrieve from a mediawiki data dump

A minimal Chatbot GUI for Ollama models - Streamlit App

This library exposes PAPA, your Personal Assistant powered by Private AI, which can be used in any browser environment and completely offline

Tool to manage ollama model on vast.ai

Create an AI-powered meeting summary tool with Python, Ollama, and Gemma

A discord chat bot utilizing AI in a fun and whimsical way. Provides some moderation tools as well.

AgentX is an Open-source library that help people use LLMs on their own computers or help them to serve LLMs as easy as possible that support multi-backends like PyTorch, llama.cpp, Ollama and EasyDeL

HTTP API for Nano Bots: small, AI-powered bots that can be easily shared as a single file, designed to support multiple providers such as Cohere Command, Google Gemini, Maritaca AI MariTalk, Mistral AI, Ollama, OpenAI ChatGPT, and others, with support for calling tools (functions).

Nano Bots for Obsidian: small, AI-powered bots that can be easily shared as a single file, designed to support multiple providers such as Cohere Command, Google Gemini, Maritaca AI, Mistral AI, Ollama, OpenAI ChatGPT, and others, + function calling.

Llava, Ollama and Streamlit | Create POWERFUL Image Analyzer Chatbot for FREE - Windows & Mac

A collection of ready to use ollama models

A Telegram bot that summarizes messages from a chat using AI.

Run ollama natively - powered by Nix

Gollama: Your offline conversational AI companion. An interactive tool for generating creative responses from various models, right in your terminal. Ideal for brainstorming, creative writing, or seeking inspiration.

Open Source bro of my conversational agent. https://github.com/mfmezger/conversational-agent-langchain

This application serves as a demonstration of the integration of langchain.js, Ollama, and ChromaDB to showcase question-answering capabilities. With a focus on Retrieval Augmented Generation (RAG), this app enables shows you how to build context-aware QA systems with the latest information.

MVP of an idea using multiple LLM models to simulate and play D&D (Local LLM via ollama support + together.ai API support)

Telegram bot to interact with ollama models

A nats micro service interacting with Ollama

Agentis is an application interface for your local AI models with Ollama allowing you to speak with text and voice with your LLM.