https://huggingface.co/timm/csatv2_21m.sw_r640_in1k (83.13% top-1) https://huggingface.co/timm/csatv2_21m.sw_r512_in1k (82.58% top-1) Factor non-persistent param init ...
Thanks to AWQ, TinyChat can deliver more efficient responses with LLM/VLM chatbots through 4-bit inference. TinyChat with LLaMA-3-8b on RTX 4090 (2.7x faster than FP16): TinyChat with LLaMA-3-8b on ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results