ylai@lemmy.ml to AI@lemmy.mlEnglish · 5 months agoNvidia’s ‘Nemotron-4 340B’ model redefines synthetic data generation, rivals GPT-4venturebeat.comexternal-linkmessage-square4fedilinkarrow-up119arrow-down11cross-posted to: [email protected]
arrow-up118arrow-down1external-linkNvidia’s ‘Nemotron-4 340B’ model redefines synthetic data generation, rivals GPT-4venturebeat.comylai@lemmy.ml to AI@lemmy.mlEnglish · 5 months agomessage-square4fedilinkcross-posted to: [email protected]
minus-squareFisch@discuss.tchncs.delinkfedilinkEnglisharrow-up1·5 months ago340B is fucking huge, holy shit. How big is GPT-4?
minus-squareylai@lemmy.mlOPlinkfedilinkarrow-up2·5 months agoThe rumor is 1.76 trillion, or 8x220B (mixture of experts) to be specific: https://wandb.ai/byyoung3/ml-news/reports/AI-Expert-Speculates-on-GPT-4-Architecture---Vmlldzo0NzA0Nzg4
340B is fucking huge, holy shit. How big is GPT-4?
The rumor is 1.76 trillion, or 8x220B (mixture of experts) to be specific: https://wandb.ai/byyoung3/ml-news/reports/AI-Expert-Speculates-on-GPT-4-Architecture---Vmlldzo0NzA0Nzg4