bot@lemmy.smeargle.fansMB to Hacker News@lemmy.smeargle.fans · 2 years agoLLM in a Flash: Efficient LLM Inference with Limited Memoryhuggingface.coexternal-linkmessage-square0linkfedilinkarrow-up13file-textcross-posted to: hcc@lemmy.dbzer0.comhackernews@derp.foo
arrow-up13external-linkLLM in a Flash: Efficient LLM Inference with Limited Memoryhuggingface.cobot@lemmy.smeargle.fansMB to Hacker News@lemmy.smeargle.fans · 2 years agomessage-square0linkfedilinkfile-textcross-posted to: hcc@lemmy.dbzer0.comhackernews@derp.foo