Newbie_step_llm

Keeping a record about how I begin to pick up LLM

Doc. written in Obsidian, which makes it inconvenient to make updates here due to multiple exclusive link format. I will update once and all when I finish and export html or pdf version.

Log

Architecture
1. simple version & Basic idea
  1. overview
  2. Encoder, Decoder
  3. Attention
    - Coding: Attention
2. comprehensive version
  1. Encoder-only: BERT
    1. architecture
    2. pre-training
  2. Decoder-only: GPT
    1. Brief history: What's new, what's the differences
    2. pre-training
    - tokenizer, embedding, transformer block, im_head
  3. Encoder-Decoder
  4. Comparation
    - bert vs. GPT
    - decoder-only vs. encoder-decoder
  - Coding <- I'm here, have been preparing for an exam recently, catch up soon
Training process
1. Overview
2. [ every step ]
Use LLM
- 实战1
- 实战2
Research

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
ref		ref
Doc..md		Doc..md
Log.md		Log.md
README.md		README.md
step1_attention_in_numpy.ipynb		step1_attention_in_numpy.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Newbie_step_llm

Log

About

Releases

Packages

Languages

SuperGreenHandRSOC/Newbie_step_llm

Folders and files

Latest commit

History

Repository files navigation

Newbie_step_llm

Log

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages