- Chaos Theory
- Posts
- 🥟 Chao-Down #69 StackOverflow charges for access to data, Microsoft and Epic team up to use GPT4 to analyze medical records, Google reorganizes Deepmind and Brain teams
🥟 Chao-Down #69 StackOverflow charges for access to data, Microsoft and Epic team up to use GPT4 to analyze medical records, Google reorganizes Deepmind and Brain teams
Plus, the challenges of deepfake detection
Another company is looking to cash in on providing the training data to large language models.
Following the recent announcement from Reddit and Twitter to increase the prices to access their APIs, StackOverflow, the popular question-answering site for software engineers, is also demanding compensation when its data is used to train AI models and ChatGPT-style bots.
Any corporate large language model that has code-generation capabilities is very likely using training data that has scraped StackOverflow pages. These companies will now need to pay up or face litigation.
Which platform will charge for access to its data next?
-Alex, your resident Chaos Coordinator.
What happened in AI? 📰
Stack Overflow Will Charge AI Giants for Training Data | (WIRED)
GPT-4 will hunt for trends in medical records thanks to Microsoft and Epic | (Ars Technica)
Jira, Confluence to Use OpenAI for New Atlassian Chat, Coding Tools (Bloomberg)
OpenAI’s hunger for data is coming back to bite it | (MIT Technology Review)
Deepfake Detection Is One Corner of AI Tech That Isn’t Booming - (Bloomberg)
Google consolidates AI research labs into Google DeepMind to compete with OpenAI (VentureBeat)
Always be Learnin’ 📕 📖
Replit - How to train your own Large Language Models (link)
Far Away Times: How To Make Good Small Games (link)
Learning to Program with Natural Language (arxiv)
Projects to Keep an Eye On 🛠
h2oai/h2o-llmstudio: H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs (Github)
danielgross/LlamaAcademy: A school for camelids (Github)
suno-ai/bark: 🔊 Text-Prompted Generative Audio Model (Github)
The Latest in AI Research 💡
Lift3D: Synthesize 3D Training Data by Lifting 2D GAN to 3D Generative Radiance Field (len-li.github.io)
Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFs (arxiv)
UniverSeg: Universal Medical Image Segmentation (arxiv)
Farm3D: Learning Articulated 3D Animals by Distilling 2D Diffusion
The World Outside of AI 🌎
Amazon launches program to identify and track counterfeiters (Yahoo)
Growth of California's warehousing industry brings concerns - (Marketplace)
You'd Be Happier Living Closer to Friends. Why Don't You? (substack.com)
When Apple Comes Calling, ‘It’s the Kiss of Death’ - (WSJ)