Originally posted by qarium
View Post
And for CPU inference it defintely uses 4-bit.
Comment