• Chaos Theory
  • Posts
  • 🥟 Chao-Down #275 Google's Gemini pauses image generation of people after reported racial bias, GenZ seeks out ChatGPT for job advice, Microsoft turns to Intel not NVIDIA to build a custom chip

🥟 Chao-Down #275 Google's Gemini pauses image generation of people after reported racial bias, GenZ seeks out ChatGPT for job advice, Microsoft turns to Intel not NVIDIA to build a custom chip

Plus, how AI hiring tools may be filtering out the best applicants.

In the latest rankings for the best community-voted large language model, GPT4 still is on top with Gemini Pro right behind.

The Large Model Systems Organization (LMSys) is a research group formed by UC Berkeley, UC San Diego, and Carnegie Mellon University. They launched Chatbot Arena in May to rank large language models using crowdsourced blind tests. Users provide prompts, compare responses from two anonymous models, and choose the better one.

The Elo rating system, commonly used in chess and other competitive games, determines the rankings. Since December, LMSys has collected more than 130,000 blind ratings for 45 different models.

OpenAI's GPT-4 models have led the rankings since their release about a year ago. However, Google's Gemini Pro (formerly Bard) and Mistral-Medium from Mistral AI in Paris have risen in the rankings lately.

It’s a space that’ll continue to get a lot of attention as developers and organizations seek to crown the best of AI.

-Alex, your resident Chaos Coordinator.

What happened in AI? 📰

Gen Z workers think their employers don’t care about their career growth, so they’re turning to ChatGPT for job advice (Fortune)

When A.I. Can Make a Movie, What Does “Video” Even Mean? (The New Yorker)

Google to Pause Gemini Image Generation of People After Issues (Bloomberg)

Google’s Gemini AI is comically woke : r/MSsEcReTPoDcAsT

Biden Deepfake and Other Audio Fakes Were Made With ElevenLabs AI (Bloomberg)

AI hiring tools may be filtering out the best job applicants (BBC)

Microsoft turns to Intel, not Nvidia, to make new chip (qz.com)

Always be Learnin’ 📕 📖

Scaling ChatGPT: Five Real-World Engineering Challenges (pragmaticengineer.com)

Why "Chat over Your Data" Is Harder Than You Think (Arcus)

A beginner’s guide to making beautiful slides for your talks (ines.io)

Projects to Keep an Eye On 🛠

FujiwaraChoki/MoneyPrinterV2: Automate the process of making money online. (Github)

lobehub/lobe-chat: 🤖 Lobe Chat - an open-source, high-performance AI Chat framework. Support one-click free deployment of your private ChatGPT/Gemini/Local LLM application. (Github)

V-JEPA: The next step toward advanced machine intelligence (Meta)

The Latest in AI Research 💡

A Critical Evaluation of AI Feedback for Aligning Large Language Models (arxiv)

Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning (arxiv)

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset (arxiv.org)

The World Outside of AI 🌎

New FDA-approved drug makes severe food allergies less life-threatening (Ars Technica)

'Soaring' over hills or 'playing' with puppies, study finds seniors enjoy virtual reality | (AP News)

Commercial Real Estate Market Plunge Has Lenders Facing a Brutal Reality - (Bloomberg)

How Love and Romance Affect Your Brain (The New York Times)

TikTok influencers are providing a second life for debunked health claims (Vox)

The Quest for a DNA Data Drive (IEEE Spectrum)

One Last Bite 😋