ML Cult

October 27th, 2023 - AI Unleashed: Decoding Sycophancy, Mastering Control, and Crafting 3D Realities

Towards Understanding Sycophancy in Language ModelsControlled Decoding from Language Models

October 27, 2023 • 8:32

October 26th, 2023 - Frontiers of AI: From Quantum Compression to Visionary Transformers

LLM-FP4: 4-Bit Floating-Point Quantized TransformersDetecting Pretraining Data from Large Language Models

October 26, 2023 • 14:15

October 25th, 2023 - Pixel to Perception: Matryoshka Synthesis, GPT-3's Linguistic Mysteries, Woodpecker's Visual Refinement, and SAM-CLIP's Vision Evolution

Matryoshka Diffusion ModelsDissecting In-Context Learning of Translations in GPTsWoo...

October 25, 2023 • 11:12

October 24th, 2023 - Neural Visions Unveiled: From FreeNoise's Video Clarity, HallusionBench's Reality Check, to FlashEdit's Instant Image Refinements

FreeNoise: Tuning-Free Longer Video Diffusion Via Noise ReschedulingHallusionBench: You See What You Think? Or You Think What You See? An Im...

October 24, 2023 • 6:35

October 23th, 2023 - Unlocking AI's Potential: From Open Waters to Self-Enhancing Miniature Models

H2O Open Ecosystem for State-of-the-art Large Language ModelsLet's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language ...

October 23, 2023 • 6:35

October 4th, 2023 - NeuroFrontiers: Pensive Processors, Natural Evolution, and the New Age of Linguistic Titans

Think before you speak: Training Language Models With Pause TokensTowards Self-Assembling Artificial Neural Networks through Neural Developm...

October 04, 2023 • 13:09

October 3nd, 2023 - Evolution in Text: Self-Improvement, Synthesis, and Scrutiny

Enable Language Models to Implicitly Learn Self-Improvement From DataPixArt-alpha: Fast Training of Diffusion Transformer for Photorealistic...

October 03, 2023 • 7:51

October 2nd, 2023 - Math to Motion: ToRA, Decaf, and DRaFT Transformations

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem SolvingDecaf: Monocular Deformation Capture for Face and Hand Interactions<...

October 02, 2023 • 6:52

September 29th, 2023 - Masters of AI Metamorphosis: From Long-Context Linguistics to 3D Dreamscapes

Effective Long-Context Scaling of Foundation ModelsDemystifying CLIP DataVision Tran...

September 29, 2023 • 16:14

September 28th, 2023 - Neural Vistas & Visual Alchemy: From NeuRBF Reconstructions to ScalarSimplicity in AI Imagery

NeuRBF: A Neural Fields Representation with Adaptive Radial Basis FunctionsEmu: Enhancing Image Generation Models Using Photogenic Needles i...

September 28, 2023 • 8:49

September 27th, 2023 - Beyond Boundaries: Pioneering Sequences, Alignments, and Realism in AI Evolution

DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer ModelsAligning Large Multimodal Models wi...

September 27, 2023 • 6:30

September 25th, 2023 - From Pixels to Precedents: Pioneering Visions in Color, Law, Code, and Sight

CoRF : Colorizing Radiance Fields using Knowledge DistillationThe Cambridge Law Corpus: A Corpus for Legal AI Research

September 25, 2023 • 10:47

September 22th, 2023 - Revolutionary Speeds & Precision: The Future of Neural Networks and Language Models

Parallelizing non-linear sequential models over the sequence lengthFast Feedforward Networks

September 22, 2023 • 13:54

September 21th, 2023 - Neural Frontiers: From FreeU's Image Mastery to Languini Kitchen's Equalized Research

FreeU: Free Lunch in Diffusion U-NetNeurons in Large Language Models: Dead, N-gram, Positional

September 21, 2023 • 14:32

September 20th, 2023 - From Overthinking Graphs to Code Whispering and Polyglot AI: The New Frontiers of Neural Networks, Language Models, and Data Compression

Graph Neural Networks Use Graphs When They Shouldn'tLarge Language Models for Compiler Optimization

September 20, 2023 • 13:20

September 12th, 2023 - Frontiers in AI: From Pint-Sized Powerhouses and Pruned Datasets to Multilingual Mastery and Image Restoration

Textbooks Are All You Need II: phi-1.5 technical reportDiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior...

September 12, 2023 • 11:11

September 11th, 2023 - Neural Frontiers: Audiobooks, Virtual Cities, Summarization, and Vision Transformers Reimagined

Large-Scale Automatic Audiobook CreationCityDreamer: Compositional Generative Model of Unbounded 3D Cities

September 11, 2023 • 9:26

September 8th, 2023 - Unlocking the Future of AI: From Master Optimizers and Budget-Friendly Giants to Truthful Decoding and Video Segmentation Breakthroughs

Large Language Models as OptimizersFLM-101B: An Open LLM and How to Train It with $100K Budget

September 08, 2023 • 11:28

September 7th, 2023 - SLiMe, Matcha-TTS, RoboSense, and CM3Leon: Revolutionizing Vision, Speech, and Multi-Modal Intelligence for a Smarter, Faster Future

SLiMe: Segment Like MeMatcha-TTS: A fast TTS architecture with conditional flow matching

September 07, 2023 • 8:11

September 6th, 2023 - Unlocking the Future of AI: Lean Transformers, Memory-Efficient RLHF, Voice-Altering Text Prompts, and 3D Virtual Humans

One Wide Feedforward is All You NeedEfficient RLHF: Reducing the Memory Usage of PPO...

September 06, 2023 • 8:02

September 5th, 2023 - Frontiers in AI Efficiency and Capability: From Turbocharged Transformers and Extended Contexts to High-Definition Video Generation and Self-Tuned Learning

Fast Inference from Transformers via Speculative DecodingYaRN: Efficient Context Window Extension of Large Language Models

September 05, 2023 • 9:59

September 1st, 2023 - Unlocking Multilingual AI & Beyond: Innovations in Data, Synthesis, Bioinformatics, and 3D Creation

The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language VariantsAny-Size-Diffusion: Toward Efficient Text-Driven Sy...

September 01, 2023 • 10:46

August 31th, 2023 - Advancing Weather Forecasts, Robotic Learning, and AI Conversations: A Trilogy of Innovation

WeatherBench 2: A benchmark for the next generation of data-driven global weather modelsRoboTAP: Tracking Arbitrary Points for Few-Shot Visu...

August 31, 2023 • 6:44

August 30th, 2023 - From AI Planners to 3D Faces: Groundbreaking Innovations in Machine Learning and Digital Media

Reward-Respecting Subtasks for Model-Based Reinforcement LearningRelightify: Relightable 3D Faces from a Single Image via Diffusion Models

August 30, 2023 • 7:55

August 29th, 2023 - Concept Dissection, Alignment Pitfalls, and Responsible AI: Pioneering Approaches in Image Generation, Language Modeling, and Healthcare

Break-A-Scene: Extracting Multiple Concepts from a Single ImageThe Poison of Alignment

August 29, 2023 • 8:53

ML Cult

Episodes

October 27th, 2023 - AI Unleashed: Decoding Sycophancy, Mastering Control, and Crafting 3D Realities

October 26th, 2023 - Frontiers of AI: From Quantum Compression to Visionary Transformers

October 25th, 2023 - Pixel to Perception: Matryoshka Synthesis, GPT-3's Linguistic Mysteries, Woodpecker's Visual Refinement, and SAM-CLIP's Vision Evolution

October 24th, 2023 - Neural Visions Unveiled: From FreeNoise's Video Clarity, HallusionBench's Reality Check, to FlashEdit's Instant Image Refinements

October 23th, 2023 - Unlocking AI's Potential: From Open Waters to Self-Enhancing Miniature Models

October 4th, 2023 - NeuroFrontiers: Pensive Processors, Natural Evolution, and the New Age of Linguistic Titans

October 3nd, 2023 - Evolution in Text: Self-Improvement, Synthesis, and Scrutiny

October 2nd, 2023 - Math to Motion: ToRA, Decaf, and DRaFT Transformations

September 29th, 2023 - Masters of AI Metamorphosis: From Long-Context Linguistics to 3D Dreamscapes

September 28th, 2023 - Neural Vistas & Visual Alchemy: From NeuRBF Reconstructions to ScalarSimplicity in AI Imagery

September 27th, 2023 - Beyond Boundaries: Pioneering Sequences, Alignments, and Realism in AI Evolution

September 25th, 2023 - From Pixels to Precedents: Pioneering Visions in Color, Law, Code, and Sight

September 22th, 2023 - Revolutionary Speeds & Precision: The Future of Neural Networks and Language Models

September 21th, 2023 - Neural Frontiers: From FreeU's Image Mastery to Languini Kitchen's Equalized Research

September 20th, 2023 - From Overthinking Graphs to Code Whispering and Polyglot AI: The New Frontiers of Neural Networks, Language Models, and Data Compression

September 12th, 2023 - Frontiers in AI: From Pint-Sized Powerhouses and Pruned Datasets to Multilingual Mastery and Image Restoration

September 11th, 2023 - Neural Frontiers: Audiobooks, Virtual Cities, Summarization, and Vision Transformers Reimagined

September 8th, 2023 - Unlocking the Future of AI: From Master Optimizers and Budget-Friendly Giants to Truthful Decoding and Video Segmentation Breakthroughs

September 7th, 2023 - SLiMe, Matcha-TTS, RoboSense, and CM3Leon: Revolutionizing Vision, Speech, and Multi-Modal Intelligence for a Smarter, Faster Future

September 6th, 2023 - Unlocking the Future of AI: Lean Transformers, Memory-Efficient RLHF, Voice-Altering Text Prompts, and 3D Virtual Humans

September 5th, 2023 - Frontiers in AI Efficiency and Capability: From Turbocharged Transformers and Extended Contexts to High-Definition Video Generation and Self-Tuned Learning

September 1st, 2023 - Unlocking Multilingual AI & Beyond: Innovations in Data, Synthesis, Bioinformatics, and 3D Creation

August 31th, 2023 - Advancing Weather Forecasts, Robotic Learning, and AI Conversations: A Trilogy of Innovation

August 30th, 2023 - From AI Planners to 3D Faces: Groundbreaking Innovations in Machine Learning and Digital Media

August 29th, 2023 - Concept Dissection, Alignment Pitfalls, and Responsible AI: Pioneering Approaches in Image Generation, Language Modeling, and Healthcare