applied-ai-engineering-samples

applied-ai-engineering-samples

TLDR: This repository contains reference guides, blueprints, code samples, and hands-on labs related to Applied AI Engineering developed by the Google Cloud Applied AI Engineering team. It includes sections on Generative AI on Vertex AI, Google Cloud AI/ML infrastructure, Research Operationalization, and various solutions catalogs.

2023-08-25 Github

FlagEmbedding

FlagEmbedding

TLDR: FlagEmbedding focuses on retrieval-augmented LLMs and consists of multiple projects including inference, finetune, evaluation, dataset, tutorials, and research. It offers various embedding and reranker models for different languages and tasks.

2023-08-02 Github

ms-swift

ms-swift

TLDR: Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).

Python
2023-08-01 Github

LLMs-from-scratch

38,600
@rasbt

LLMs-from-scratch

TLDR: This repository contains code for developing, pretraining, and finetuning a GPT-like LLM. It is the official code repository for the book 'Build a Large Language Model (From Scratch)'. The code is designed to run on conventional laptops and automatically utilizes GPUs if available. It also includes bonus materials and has specific hardware requirements.

2023-07-23 Github

llm-app

llm-app

TLDR: Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

2023-07-19 Github

generative-ai-for-beginners

generative-ai-for-beginners

TLDR: 21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

2023-06-19 Github

llm-course

llm-course

TLDR: This repository contains a comprehensive course on large language models, divided into three parts: LLM Fundamentals, The LLM Scientist, and The LLM Engineer. It includes notebooks and articles related to various aspects of large language models such as fine-tuning, quantization, and building applications.

2023-06-17 Github

gpt-researcher

gpt-researcher

TLDR: GPT Researcher is an autonomous agent for comprehensive web and local research. It produces detailed research reports with citations, addresses issues of misinformation and token limitations in LLMs, and offers features like smart image scraping, long report generation, and multiple source aggregation. It can be installed via various methods and has a multi-agent assistant and enhanced frontend applications.

2023-05-12 Github

swarms

swarms

TLDR: The swarms repository provides an enterprise-grade production-ready multi-agent orchestration framework. It offers a variety of agent architectures and tools for tasks such as financial analysis, healthcare diagnosis, and task routing. The framework is highly customizable and includes features like sequential and parallel processing, long-term memory integration, and multi-modal capabilities.

2023-05-11 Github

aider

aider

TLDR: Aider is an AI pair programming tool that works in your terminal and edits code in local git repositories. It works best with GPT-4o and Claude 3.5 Sonnet and can connect to almost any LLM. It offers features like automatic git commits, works with multiple languages, and can edit multiple files at once. It has top tier performance on SWE Bench and received kind words from users.

2023-05-09 Github