Profile Picture
  • All
  • Search
  • Images
  • Videos
    • Shorts
  • Maps
  • News
  • More
    • Shopping
    • Flights
    • Travel
  • Notebook
Report an inappropriate content
Please select one of the options below.
NVIDIA
H100
A100 GPU
H100
NVIDIA A100
超威 H100gpu
NVIDIA
H100 拆解
NVIDIA
H100 Repair
GPU
H200
H100
GPU
NVIDIA
H800 vs H100
A100
NVIDIA A100
Price
H100 白皮书
H100 GPU
Tear Down
NVIDIA A100
80GB Gaming
英伟达 H100 发布会
NVIDIA
H100 PCIe 80GB 官网报价
A100
Server
NVIDIA H100 GPU
Config
NVIDIA
H100 Tear Down
NVIDIA Tesla A100
80GB PCI-E Video
NVIDIA A100
Rack
Grok 3
Hg100ma
NVIDIA
Tesla H100 80GB
Ai100
Lm Studio
NVIDIA A100
GPU
H100
NVIDIA
H200 PCIe 卡论坛
NVIDIA
H100 Price
  • Length
    AllShort (less than 5 minutes)Medium (5-20 minutes)Long (more than 20 minutes)
  • Date
    AllPast 24 hoursPast weekPast monthPast year
  • Resolution
    AllLower than 360p360p or higher480p or higher720p or higher1080p or higher
  • Source
    All
    Dailymotion
    Vimeo
    Metacafe
    Hulu
    VEVO
    Myspace
    MTV
    CBS
    Fox
    CNN
    MSN
  • Price
    AllFreePaid
  • Clear filters
  • SafeSearch:
  • Moderate
    StrictModerate (default)Off
Filter
    NVIDIA
    H100
    A100 GPU
    H100
    NVIDIA A100
    超威 H100gpu
    NVIDIA
    H100 拆解
    NVIDIA
    H100 Repair
    GPU
    H200
    H100
    GPU
    NVIDIA
    H800 vs H100
    A100
    NVIDIA A100
    Price
    H100 白皮书
    H100 GPU
    Tear Down
    NVIDIA A100
    80GB Gaming
    英伟达 H100 发布会
    NVIDIA
    H100 PCIe 80GB 官网报价
    A100
    Server
    NVIDIA H100 GPU
    Config
    NVIDIA
    H100 Tear Down
    NVIDIA Tesla A100
    80GB PCI-E Video
    NVIDIA A100
    Rack
    Grok 3
    Hg100ma
    NVIDIA
    Tesla H100 80GB
    Ai100
    Lm Studio
    NVIDIA A100
    GPU
    H100
    NVIDIA
    H200 PCIe 卡论坛
    NVIDIA
    H100 Price
You now convert any LLM into a faster one without retraining from scratch.NVIDIA just did this to their 30B model. Here's the trick:1. Duplicate the model into two copies2. Freeze one copy, it just reads the prompt and remembers context3. Train the other copy to write chunks of text at once instead of one word at a time4. Run them togetherThe frozen copy barely costs anything (it's already trained). The new copy only needed ~8% of the original training data to learn the new trick.Result: 2.4x fa
0:13
You now convert any LLM into a faster one without retraining from scratch.NVIDIA just did this to their 30B model. Here's the trick:1. Duplicate the model into two copies2. Freeze one copy, it just reads the prompt and remembers context3. Train the other copy to write chunks of text at once instead of one word at a time4. Run them togetherThe frozen copy barely costs anything (it's already trained). The new copy only needed ~8% of the original training data to learn the new trick.Result: 2.4x fa
103.4K views1 day ago
x.comLior Alexander
See more
Static thumbnail place holder
More like this
  • Privacy
  • Terms