chtnnh's Digital Garden

              • aiSafety
              • compressingQuantizedLanguageModels
            • 01_machine_learning_and_deep_learning
            • 02_large_language_models
            • 03_vision_language_models
            • 04_ai_agents
            • 05_advanced_topics
            • 06_aws_for_ai_engineering
            • 07_terraform_for_ai_infrastructure
            • 08_open_source_projects_and_communities
            • training-and-evaluation
          • roadmap
          • quickRefresher
          • list
          • todo
              • mechInterp
              • Proof for the Macroscopic Effects of Quantum Events via the Butterfly Effect and Chaos Theory
              • frauchigerRennerDeepDive
            • shouldYouStartAStartup
          • antlerMENAP
            • README
            • README
            • README
            • README
            • README
            • README
            • README
            • README
          • CONTRIBUTING
          • LICENSE
          • README
        • Graph View Guide
        • IMPLEMENTATION_SUMMARY
        • Master MOC - Map of Content
        • Projects Tracker
        • README
        • TAGGING_SUMMARY
      Home

      ❯

      learning

      ❯

      ai

      ❯

      notes

      ❯

      02_large_language_models

      02_large_language_models

      May 16, 20251 min read

      Large Language Models (LLMs)

      Introduction

      Large Language Models (LLMs) are powerful AI systems that can understand and generate human-like text. They have revolutionized natural language processing and are the foundation of many modern AI applications.

      Topics Covered

      1. Algorithms

      • Byte-Pair Encoding (BPE)
      • Self-Supervised Learning
      • Supervised Learning (Instruction Tuning)
      • Reinforcement Learning

      2. Architectures

      • Transformer Architecture
      • Encoder-Decoder
      • Causal Decoder
      • Prefix Decoder

      3. Training Techniques

      • Data Collection and Preprocessing
      • Model Configuration
      • Model Training
      • Fine-tuning

      4. Optimization

      • Model Pruning
      • Knowledge Distillation

      5. Tools and Libraries

      • Hugging Face Transformers
      • TensorFlow
      • PyTorch
      • LangChain
      • LlamaIndex

      Learning Resources

      Online Courses

      • LLM University by Cohere
      • Large Language Model Course by Maxime Labonne

      Documentation and Guides

      • Hugging Face Transformers Documentation
      • PyTorch Documentation
      • TensorFlow Documentation
      • LangChain Documentation
      • LlamaIndex Documentation

      Research Papers and Articles

      • Large language model - Wikipedia
      • Large language model training: how three training phases shape LLMs
      • Exploring Architectures and Configurations for Large Language Models

      Graph View

      • Large Language Models (LLMs)
      • Introduction
      • Topics Covered
      • 1. Algorithms
      • 2. Architectures
      • 3. Training Techniques
      • 4. Optimization
      • 5. Tools and Libraries
      • Learning Resources
      • Online Courses
      • Documentation and Guides
      • Research Papers and Articles

      darukavana - chtnnh's digital garden | Product Hunt

      • GitHub