Purpose

This place where I store whole things I know, learn and inspect about AI, ML and DA to build up these into consistent collections

General Awesome Repositories

Repository

Page

Topic

Organization

Landscape

Artificial intelligence & Machine Learning

Toolkits

  • openvino: OpenVINOβ„’ is an open-source toolkit for optimizing and deploying AI inference
  • ray: an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
  • pachyderm: Data-Centric Pipelines and Data Versioning

VectorDB

  • Milvus: A high-performance, highly scalable vector database that runs efficiently across a wide range of environments, from a laptop to large-scale distributed systems
  • Chroma: The AI-native open-source vector database (Opensource)

LLM

  • private-gpt: Interact with your documents using the power of GPT, 100% privately, no data leaks. Website

Computer Vision

Text To Speech

  • VALL-E-X: An open source implementation of Microsoft’s VALL-E X zero-shot TTS model

Models

  • ColossalAI: Making large AI models cheaper, faster and more accessible
  • trl: Train transformer language models with reinforcement learning. Hugging Face

Blogs

Articles

Youtube Channel

  • NeuralNine : Educational brand focusing on programming, machine learning and computer science
  • sentdex : Funny guy who teach you about build cool stuff with python like AI
  • MLOps.community : The MLOps Community fills the swiftly growing need to share real-world Machine Learning Operations best practices from engineers in the field

Data Analysis

Awesome Repositories

Tools

  • airbyte: The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
  • airflow: A platform to programmatically author, schedule, and monitor workflows
  • active_workflow: Polyglot workflows without leaving the comfort of your technology stack.
  • datahub: The Metadata Platform for your Data Stack

Blogs

Articles