Hacker News with Generative AI: Foundation Models

Roblox Releases Cube: Generative AI System for 3D (github.com/Roblox)
Foundation models trained on vast amounts of data have demonstrated remarkable reasoning and generation capabilities in the domains of text, images, audio and video. Our goal is to build such a foundation model for 3D intelligence, a model that can support developers in producing all aspects of a Roblox experience, from generating 3D objects and scenes to rigging characters for animation to producing programmatic scripts describing object behaviors.
Roblox releases code for Cube 3D Mesh Gen model (github.com/Roblox)
Foundation models trained on vast amounts of data have demonstrated remarkable reasoning and generation capabilities in the domains of text, images, audio and video. Our goal is to build such a foundation model for 3D intelligence, a model that can support developers in producing all aspects of a Roblox experience, from generating 3D objects and scenes to rigging characters for animation to producing programmatic scripts describing object behaviors.
All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning (arxiv.org)
From a first-principles perspective, it may seem odd that the strongest results in foundation model fine-tuning (FT) are achieved via a relatively complex, two-stage training procedure.
Magma: A foundation model for multimodal AI agents (microsoft.github.io)
Magma is the first foundation model that is capable of interpreting and grounding multimodal inputs within its environment. Given a described goal, Magma is able to formulate plans and execute actions to achieve it. By effectively transferring knowledge from freely available visual and language data, Magma bridges verbal, spatial and temporal intelligence to navigate complex tasks and settings.
Automated Capability Discovery via Foundation Model Self-Exploration (arxiv.org)
Foundation models have become general-purpose assistants, exhibiting diverse capabilities across numerous domains through training on web-scale data.
Emerging AI Agent Architecture for Fintech with DeepSeek Foundation Models (ycombinator.com)
Folks, we're defining a new AI Agent architecture for Fintech to support different foundation models.
Oumi-AI/oumi: Everything you need to build foundation models, end-to-end (github.com/oumi-ai)
Oumi is a fully open-source platform that streamlines the entire lifecycle of foundation models - from data preparation and training to evaluation and deployment.
Show HN: TabPFN v2 – A SOTA foundation model for small tabular data (nature.com)
Tabular data, spreadsheets organized in rows and columns, are ubiquitous across scientific fields, from biomedicine to particle physics to economics and climate science1,2.
Automating the search for artificial life with foundation models (sakana.ai)
For the past 300,000 years, Earth has had only one form of advanced intelligence on it: humans. With the recent advent of AI foundation models, some believe we are at the dawn of a new kind of intelligence.
Amazon Nova (amazon.com)
Today, we’re thrilled to announce Amazon Nova, a new generation of state-of-the-art foundation models (FMs) that deliver frontier intelligence and industry leading price performance, available exclusively in Amazon Bedrock.
Nucleotide Transformer: building robust foundation models for human genomics (nature.com)
The prediction of molecular phenotypes from DNA sequences remains a longstanding challenge in genomics, often driven by limited annotated data and the inability to transfer learnings between tasks.
Show HN: Foundation models for time series forecasting (github.com/wearesulie)
The Sulie SDK offers seamless integration with the Sulie platform for advanced time series forecasting powered by Mimosa—a transformer-based foundation model optimized specifically for time series data.
Depth Pro: Sharp monocular metric depth in less than a second (github.com/apple)
We present a foundation model for zero-shot metric monocular depth estimation. Our model, Depth Pro, synthesizes high-resolution depth maps with unparalleled sharpness and high-frequency details.
Domain-Aware Fine-Tuning of Foundation Models (arxiv.org)
On Open-Weights Foundation Models (ftc.gov)
Meta Large Language Model Compiler: Foundation Models of Compiler Optimization (meta.com)
Apple's On-Device and Server Foundation Models (apple.com)
IBM Granite: A Family of Open Foundation Models for Code Intelligence (github.com/ibm-granite)
OpenEQA: Embodied Question Answering in the Era of Foundation Models (open-eqa.github.io)
Rerank 3: A new foundation model for efficient enterprise search and retrieval (cohere.com)