Skip to content

Data News — Week 23.46

Data News #23.46 — Sam Altamn has been fired as CEO of OpenAI, all AI News and catching up the news from the last month.

Christophe Blefari
Christophe Blefari
6 min read
person in gray shirt with backpack walking on street between houses
Back in town (credits)

Hey, it's been a few weeks since I've not written any news. It was a necessary break for me and a blank page syndrome at the same time. Still I've accumulated a lot of articles that I think should fit in the Data News so this week might be a huge recap of content that has been produce in the last month.

I hope you will enjoy the selection.

On Monday I'll also give a talk at Berlin MotherDuck meetup: DuckDB experiments, a glimpse of the future. I think it will not be live but the recording will be published after the event on YouTube I think.

Not sure there are still free seats, but if you want to come reach me.

AI News 🤖

  • Sam Altman has been fired as CEO of OpenAI.
    • OpenAI announced this leadership transition yesterday. At the same time Greg Brockman (actual President and co-founder) will step down from the chairman of board and Mira Murati (actual CTO) will become interim-CEO. It was a brutal decision.
    • The public official given reason was "[Sam] was not consistently candid in his communications with the board, hindering its ability to exercise its responsibilities. The board no longer has confidence.".
    • The Internet has spent the last 15 hours guessing what this really meant. Here are a few theories I've read: a security leak occurred and Sam/Greg hid it from the board, Sam is publicly accused of sexual abuse by his sister, Sam has different views about company vision which doesn't please the board—esp. regarding profits or AI regulations, Sam invested in an OpenAI competitor. Either way, we'll see in a few days.
    • People are mostly saddened by the news because Sam was a publicly-beloved and transparent CEO who changed AI. Comparisons with the coup that overthrew Steve Jobs back in the days are many.
  • The news arrived a few day after OpenAI dev-day, a public conference announcing new products and features. Mainly they announced GPTs, a no-code UI to create custom versions of ChatGPT.
  • Other AI announcements
    • Github Universe was the moment to announce more Copilot everywhere in Github ecosystem. The most interesting thing was the fact that Github will introduce M1 and GPU runners.
    • xAI—the company founded by Musk after quitting OpenAI—announced Grok. It's a 33B parameters LLM.
    • Germany wants to build the European OpenAI competitor and invested $500m in Aleph Alpha, a startup. On the landing page it's clear that the focus is to build safe AI.
    • Kyutai has been announcement at a AI Pulse event in Station F, Paris. Kyutai is an open science lab to build and democratize AGI—artificial general intelligence—through open science. They carefully picked open science rather than open-source. The team looks great.
    • The GPU availability competition is on. Y Combinator announced a Microsoft partnership and priority access to compute resources. This is linked as well to Microsoft making custom AI chips.
    • Biden issues executive order on safe, secure, and trustworthy AI.
  • 2 reports with hundreds of pages about AI were published — The State of AI report and AI: The Coming Revolution. Both looks full of interesting things to say, but I did not read them.
  • Google team wrote a paper "demonstrating various failure modes of transformers and degradation of their generalization for even simple extrapolation tasks". In a nutshell, LLM can't generalize.
Silicon Valley (TV Series 2014–2019) - IMDb
🍿 (© Silicon Valley HBO series)

Now that I gave you the general news, let's jump to a few use-cases about AI.

Fast News ⚡️

Because the AI News is pretty packed and I still want you to enjoy this newsletter articles will be less commented than usual. But still spicy opinion, because you know, it's me.

  • Data contracts is undoubtedly a new growth lever for data observability companies and data VCs. Soda announced their open-source data contracts engine. It's done in YAML. Here another example of contracts with msgspec.
  • NVidia research has been able to supercharge pandas with cuDF to run pandas on GPUs.
  • Wes McKinney, pandas and Arrow creator will join Posit—the company behind RStudio—as a Principal Architect. His new role will probably ease the integration in the Posit ecosystem of all the Python tooling, even if it has already been the case for months.
  • dbt Labs hired Brandon Sweeney as new President and COO. Brandon was previously dealing with Revenue at Hashicorp. The same company which recently changed licensing to BSL getting backslashed by the tech community for it. Our prayers goes to dbt Core.
  • Onehouse , Microsoft and Google are working on table format standard called Onetable. This isn't a new format but a way to create interoperability between Delta, Iceberg and Hudi.
  • If you are curious about Iceberg and Hudi ACID guarantees read the article.
  • Code faster with Ruff, a Python formatter written in Rust. All the time wasted for black to reformat your code will be used for good purpose now.

Taking other companies as example is often a good way to get ideas

A few food for thought articles about data concepts and roles.

Data Economy 💰

  • ZenML raises $3.7m additional Seed. A MLOps platform that works with all cloud and tools.
  • Snowflake acquire Sisu and Ponder. The first one is an engine to monitor business metrics while the second is a tool to run pandas at scale.
  • Yahoo spin-out Vespa and raises $31m. Vespa is a search engine and a vector database. This is the good timing to open-source is for AI use-cases.
  • Aleph Alpha raises $500m Series B to build the German OpenAI.
  • Kyutai is funded with $330m from 2 French billionaires and Eric Schmidt—ex-Google CEO. Kyutai is a open science lab that wants to build the AGI. The team as a good resume and the science committee looks awesome (Yejin Choi, Yann Lecun and Bernhard Schölkopf).
green palm trees beside building during daytime
Dreaming of sun (credits)

Ghost implemented a recommendation feature recently so I've added a few folks I like to read on internet.

See you next week ❤️.

Data News

Data Explorer

The hub to explore Data News links

Search and bookmark more than 2500 links

Explore

Christophe Blefari

Staff Data Engineer. I like 🚲, 🪴 and 🎮. I can do everything with data, just ask.

Comments


Related Posts

Members Public

Data News — Week 24.45

Data News #24.45 — dlt Paris meetup and Forward Data Conference approaching soon, SearchGPT, new Mistral API, dbt Coalesce and announcements and more.

Members Public

Data News — Week 24.40

Data News #24.40 — Back in Paris, Forward Data Conference program is out, OpenAI and Meta new stuff, DuckCon and a lot of things.