Skip to content

Data News — Week 24.37

Data News #24.37 — OpenAI o1 new series, building low cost platform with Model dlt and dbt, Data teams survey, feature store, Ibis without pandas.

Christophe Blefari
Christophe Blefari
5 min read
Back to work (credits)

Hey you, can you believe it's already September? This year has been flying. It feels like I just blinked, and here we are. In August, I've been focusing mainly on my next big journey—if you follow me on LinkedIn, you might have caught a sneak peek! I'll be making a full announcement next week. I want to take the time to explain my thought process and ideas behind it. I hope you will like it.

Below are the Data News wrapping summer and the first two weeks of Sept.

AI News 🤖

  • OpenAI released 2 new models OpenAI o1-preview and o1-mini — These models brings changes and a breakpoint in the models naming. OpenAI decide to give up on the GPT naming, which means GPT-5 will never be plugged in. GPT paper has been co-authored by 4 person and 3 are not anymore at OpenAI, leaving GPTs also mark a change in paradigm.

    The o1 series brings more “reasoning”, it looks like a pre-prompt that does a chain of thoughts on top of what they already did best. Lots of stories about exceptional things the model can do have been published today—e.g. in the OpenAI system card explained that the model was able during a cybersecurity challenge (a CTF) to understand a failing Docker environment (due to infra) and still be able to find the flag.

    Here a YouTube playlist demonstrating o1 capacities.

    As clem mentioned on Twitter, it's always important to pay attention to words, even if the “reason” model, it doesn't think, it processes.
  • More news about OpenAI
    • New models are already available on Azure ; but be careful Microsoft open-source Phi-3.5-mini is out.
    • Ilya Sutskever, previously Chef Scientist at OpenAI, raised 1b$ to co-found Safe Superintelligence with a manifesto.
    • Alexis Conneau, Her ex-research lead at OpenAI, decided to create a new company and got a lot of Tweet impressions. Previous OpenAI members are quite popular when it comes to founding.
    • Bloomberg reported that OpenAI seeks to raise $11,5b more at $150b valuation, making it the third private company in terms on valuation [paywall article].
    • NEO Beta, a humanoid company backed by OpenAI, released a first video demo. And it's impressive (🙃), the robot is able to handover a bag to a human!
    • We hope next OpenAI model is not o7. /s
  • OpenAI and Anthropic will give their model first to US gov (NIST) to help advance safe and trustworthy AI innovation for all. But they cry when in Europe the AI Act is voted threatening innovation.
  • NVidia released Eagle a vision-centric multimodal LLM — Look at the example in the Github repo, given an image and a user input the LLM is able to answer things like "Describe the image in detail" or "Which car in the picture is more aerodynamic" based on a drawing.
  • Aleph Alpha introduced Pharia-1-LLM — it's a 7B model and the license is explicitly targets non-commercial and research usages. Aleph Alpha is a German company, funded by German VCs (with $500m), was trying to compete with US companies (like Mistral and OpenAI 🤭) in the models race but gave up this competition to pivot to a AI-support company for public sector.

Fast News ⚡️

Calm data flows (credits)

See you next week ❤️

Data News

Data Explorer

The hub to explore Data News links

Search and bookmark more than 2500 links

Explore

Christophe Blefari

Staff Data Engineer. I like 🚲, 🪴 and 🎮. I can do everything with data, just ask.

Comments


Related Posts

Members Public

Data News — Week 24.34

Data News #24.34 — Forward Data Conference guest speakers, Data Engineering for AI/ML, AI news and a lot of great fast news.

Members Public

Data News — Week 24.30

Data News #24.30 — TV shopping for foundational models (OpenAI, Mistral, Meta, Microsoft, HF), BigQuery newly released stuff, and more obviously.