Welcome to Incremental Social! Learn more about this project here! Check out lemmyverse to find more communities to join from here!
kiku123 , 25 days ago I agree. My 3070 runs the 8B Llama3 model in about 250ms, especially for short responses.
I agree. My 3070 runs the 8B Llama3 model in about 250ms, especially for short responses.