CoinWorld news, Tether recently showcased its newly launched QVAC system, successfully running the LLAMA 3.2 (1 billion parameters) model on mobile devices using llama.cpp, achieving efficient local inference. QVAC is a universal inference and fine-tuning runtime designed to adapt to various terminal devices, including smartphones, laptops, and servers. It currently supports multiple models and will expand support for more models in the future. [Wu said]