πŸš“ NVIDIA raided in France πŸš“

In today’s edition (29 Sept):

πŸ“° Quick News - NVIDIA raided in France and more

πŸ“š Nerd section - How to run Llama 2 70B on consumer GPUs like NVIDIA RTX 3090 or 4090 and more

πŸ“° Quick News πŸ“°

πŸš“ Nvidia offices raided by French competition authority: amid soaring demand and record revenues, Nvidia's prominence in the chip industry attracts attention, resulting in a French raid to probe potential anti-competitive practices, highlighting heightened global government vigilance.

πŸŽ™οΈ Spotify is adding auto-generated transcripts to millions of podcasts: introducing time-synced textual accompaniments for episodes, making content more accessible and skimmable. This move follows their recent AI-driven voice cloning tool for translations, promising to bolster user experience and engagement.

πŸ’Έ China chipmaker Enflame raises $274M: amid a national push for semiconductor self-reliance, Enflame Technology secures significant funding, led by Shanghai International Group and backed by giants like Tencent. As US sanctions expand, China advances in chip manufacturing, with even Huawei showcasing gains despite import bans. The landscape shift sees global companies like Nvidia adjusting their strategies for the Chinese market.

πŸ›‚UK pushes for greater access to AI’s inner workings to assess risks: the UK emphasizes the need for transparency in AI to understand its potential risks, proposing open-access AI models and explainable AI systems while promoting collaboration among global stakeholders to strike a balance between innovation and accountability.

πŸ”’ Your website can now opt out of training Google's Bard and future AIs: granting webmasters the choice to prevent their content from feeding Google's AI training, amidst discussions on consent and ethical data collection practices​.

πŸ“š Nerd section πŸ“š

πŸ› οΈ Discover how ExLlamaV2 facilitates mixed-precision quantization, enabling the operation of Llama 2 70B on consumer GPUs like NVIDIA RTX 3090 or 4090 with limited VRAM, aiming for an average precision below 3-bit.

πŸ“Ž Microsoft and MIT Researchers Hope to Reduce AI Hallucinations with DoLa: Introducing a novel decoding approach, DoLa prioritizes deeper layers in LLMs to enhance factual output and reasoning, showing superior results in GPT-4 tests, without the need for external knowledge retrieval; however, it hasn't been tested across all domains and leans solely on the model's existing knowledge.

🎨 RealFill, Adobe Forefly alike, leverages reference images to generate personalized inpainting models, achieving authentic image completion that remains faithful to the original scene even under varying conditions, outperforming baseline methods in fidelity and visual quality.

Thank you for reading today’s edition!

We love to hear back from you!

Feel free to reply to our emails with questions, suggestions, or topics you'd like to see covered, or drop us a message on Twitter or Facebook.

Until tomorrow,
- Tsvetelin (Bits and Neurons)