Purpose

This place where I store whole things I know, learn and inspect about AI, ML and DA to build up these into consistent collections

center

General Knowledge

Landscape

Organization

Page

Artificial Intelligence / Machine Learning

Articles

Awesome Repositories

Blogs

Topic

Youtube Channel

  • NeuralNine : Educational brand focusing on programming, machine learning and computer science
  • sentdex : Funny guy who teach you about build cool stuff with python like AI
  • MLOps.community : The MLOps Community fills the swiftly growing need to share real-world Machine Learning Operations best practices from engineers in the field

Data Analysis

Articles

Awesome Repositories

Blogs

Topic

MLOps

Articles

Awesome Repositories

Blogs

Code Example

Topics

AI/ML/Data/MLOps Tools

Computer Vision

Data Orchestration Workflow

  • airbyte: The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

  • airflow: A platform to programmatically author, schedule, and monitor workflows

Labeling and Annotation

  • Argilla: a collaboration tool for AI engineers and domain experts to build high-quality datasets

LLM UI

  • chainlit: Build Conversational AI in minutes ⚑️
  • streamlit: A faster way to build and share data apps.

LLM

  • langfuse: πŸͺ’ Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets
  • litellm: Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format
  • TaskingAI: A BaaS (Backend as a Service) platform forΒ LLM-based Agent Development and Deployment

Models

  • trl: Train transformer language models with reinforcement learning. Hugging Face

Model Management and Serving

  • KServe: A standardΒ Model Inference PlatformΒ onΒ Kubernetes, built forΒ highly scalableΒ use cases.
  • mlflow: Open source platform for the machine learning lifecycle
  • MLServer: An open source inference server for your machine learning models.
  • ray: an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Streaming / CDC

  • bytewax: Python Stream Processing
  • debezium: Change data capture for a variety of databases

Toolkits

  • openvino: OpenVINOβ„’ is an open-source toolkit for optimizing and deploying AI inference
  • pachyderm: Data-Centric Pipelines and Data Versioning

Training

  • open_flamingo: An open-source framework for training large multimodal models.

VectorDB

  • Chroma: The AI-native open-source vector database (Opensource)
  • Milvus: A high-performance, highly scalable vector database that runs efficiently across a wide range of environments, from a laptop to large-scale distributed systems