firecrawl

firecrawl

TLDR: Firecrawl is an API service that crawls URLs and converts them into clean markdown or structured data. It offers advanced scraping, crawling, and data extraction capabilities with features like LLM-ready formats, customizability, and more. It also has SDKs for various languages and integrations with multiple frameworks.

2024-04-15 Github

llama_index

llama_index

TLDR: LlamaIndex is a data framework for LLM applications. It provides data connectors, ways to structure data, an advanced retrieval/query interface, and easy integrations. It has starter and customized options in Python and comes with important links and an ecosystem including LlamaHub and LlamaLab. Contributions are welcome and full documentation is available.

2022-11-02 Github