Llama 3.1 Megathread

Blaed@lemmy.world · 4 months ago

Llama 3.1 Megathread

DreamDrifter@lemmynsfw.com · 4 months ago

Agreed - 8b has enough magic to hold a conversation and do small tasks, such as breaking up a large task or picking out key details, which can then be fed into more small models (maybe even more narrowly fine-tuned ones)

180b isn’t enough to replace all the other pieces of a system that you need for autonomous action or memory

I think 8b models are enough to make AGI possible if we stack them just right. They’re enough to fill in most of the gaps to make practical things too, and they’re not that far off for everything else

Llama 3.1 Megathread

Llama 3.1 Megathread

Llama 3.1 Megathread

Official Meta News & Documentation

HuggingFace Download Links

`8B`

`70B`

`405B`

Getting the models

Running the models

`Linux`

`Windows`

`Mac`

`Cloud`

More guides and resources

YouTube media