Latest Open Source Projects

browser-use

19,800

browser-use

TLDR: The browser-use repository provides an easy way to connect AI agents with the browser. It offers features like vision and html extraction, multi-tab management, custom actions, and parallelization of agents. It also collects anonymous usage data for improvement.

ai-agents ai-tools browser-automation llm python Python

2024-10-31 Github

open-computer-use

594

@e2b-dev

open-computer-use

TLDR: A secure cloud Linux computer powered by E2B Desktop Sandbox and controlled by open-source LLMs. Supports various LLMs like Meta Llama and OS-Atlas. Operates via keyboard, mouse and shell commands. Easily add new LLMs adhering to OpenAI API specification.

agent ai anthropic claude computer-use llm Python

2024-10-31 Github

text-extract-api

2,100

@CatchTheTornado

text-extract-api

TLDR: A tool for converting images, PDFs, and Office documents to Markdown or JSON with high accuracy. Built with FastAPI, uses Celery for asynchronous tasks and Redis for caching. Supports various OCR strategies and can remove PII. Comes with a CLI tool and has different storage strategies. Also has an online demo and dedicated API clients.

anonymization api extract json llm ocr ocr-python pdf pii Python

2024-10-23 Github

FlagEmbedding

8,300

@FlagOpen

FlagEmbedding

TLDR: FlagEmbedding focuses on retrieval-augmented LLMs and consists of multiple projects including inference, finetune, evaluation, dataset, tutorials, and research. It offers various embedding and reranker models for different languages and tasks.

embeddings information-retrieval llm retrieval-augmented-generation sentence-embeddings text-semantic-similarity Python

2023-08-02 Github

llama_index

38,300

@run-llama

llama_index

TLDR: LlamaIndex is a data framework for LLM applications. It provides data connectors, ways to structure data, an advanced retrieval/query interface, and easy integrations. It has starter and customized options in Python and comes with important links and an ecosystem including LlamaHub and LlamaLab. Contributions are welcome and full documentation is available.

agents application data fine-tuning framework llamaindex llm multi-agents rag vector-database Python

2022-11-02 Github

khoj

25,400

@khoj-ai

khoj

TLDR:

agent ai assistant chat chatgpt emacs image-generation llama3 llamacpp llm obsidian obsidian-md offline-llm productivity rag research self-hosted semantic-search stt whatsapp-ai Python

2021-08-16 Github