ML Cult
A curated podcast covering the latest machine learning developments, text, and audio is generated using AI.
Episodes
75 episodes
October 27th, 2023 - AI Unleashed: Decoding Sycophancy, Mastering Control, and Crafting 3D Realities
Towards Understanding Sycophancy in Language ModelsControlled Decoding from Language Models
•
8:32
October 26th, 2023 - Frontiers of AI: From Quantum Compression to Visionary Transformers
LLM-FP4: 4-Bit Floating-Point Quantized TransformersDetecting Pretraining Data from Large Language Models
•
14:15
October 25th, 2023 - Pixel to Perception: Matryoshka Synthesis, GPT-3's Linguistic Mysteries, Woodpecker's Visual Refinement, and SAM-CLIP's Vision Evolution
Matryoshka Diffusion ModelsDissecting In-Context Learning of Translations in GPTsWoo...
•
11:12
October 24th, 2023 - Neural Visions Unveiled: From FreeNoise's Video Clarity, HallusionBench's Reality Check, to FlashEdit's Instant Image Refinements
FreeNoise: Tuning-Free Longer Video Diffusion Via Noise ReschedulingHallusionBench: You See What You Think? Or You Think What You See? An Im...
•
6:35
October 23th, 2023 - Unlocking AI's Potential: From Open Waters to Self-Enhancing Miniature Models
H2O Open Ecosystem for State-of-the-art Large Language ModelsLet's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language ...
•
6:35
October 4th, 2023 - NeuroFrontiers: Pensive Processors, Natural Evolution, and the New Age of Linguistic Titans
Think before you speak: Training Language Models With Pause TokensTowards Self-Assembling Artificial Neural Networks through Neural Developm...
•
13:09
October 3nd, 2023 - Evolution in Text: Self-Improvement, Synthesis, and Scrutiny
Enable Language Models to Implicitly Learn Self-Improvement From DataPixArt-alpha: Fast Training of Diffusion Transformer for Photorealistic...
•
7:51
October 2nd, 2023 - Math to Motion: ToRA, Decaf, and DRaFT Transformations
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem SolvingDecaf: Monocular Deformation Capture for Face and Hand Interactions<...
•
6:52
September 29th, 2023 - Masters of AI Metamorphosis: From Long-Context Linguistics to 3D Dreamscapes
Effective Long-Context Scaling of Foundation ModelsDemystifying CLIP DataVision Tran...
•
16:14
September 28th, 2023 - Neural Vistas & Visual Alchemy: From NeuRBF Reconstructions to ScalarSimplicity in AI Imagery
NeuRBF: A Neural Fields Representation with Adaptive Radial Basis FunctionsEmu: Enhancing Image Generation Models Using Photogenic Needles i...
•
8:49
September 27th, 2023 - Beyond Boundaries: Pioneering Sequences, Alignments, and Realism in AI Evolution
DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer ModelsAligning Large Multimodal Models wi...
•
6:30
September 25th, 2023 - From Pixels to Precedents: Pioneering Visions in Color, Law, Code, and Sight
CoRF : Colorizing Radiance Fields using Knowledge DistillationThe Cambridge Law Corpus: A Corpus for Legal AI Research
•
10:47
September 22th, 2023 - Revolutionary Speeds & Precision: The Future of Neural Networks and Language Models
Parallelizing non-linear sequential models over the sequence lengthFast Feedforward Networks
•
13:54
September 21th, 2023 - Neural Frontiers: From FreeU's Image Mastery to Languini Kitchen's Equalized Research
FreeU: Free Lunch in Diffusion U-NetNeurons in Large Language Models: Dead, N-gram, Positional
•
14:32
September 20th, 2023 - From Overthinking Graphs to Code Whispering and Polyglot AI: The New Frontiers of Neural Networks, Language Models, and Data Compression
Graph Neural Networks Use Graphs When They Shouldn'tLarge Language Models for Compiler Optimization
•
13:20
September 12th, 2023 - Frontiers in AI: From Pint-Sized Powerhouses and Pruned Datasets to Multilingual Mastery and Image Restoration
Textbooks Are All You Need II: phi-1.5 technical reportDiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior...
•
11:11
September 11th, 2023 - Neural Frontiers: Audiobooks, Virtual Cities, Summarization, and Vision Transformers Reimagined
Large-Scale Automatic Audiobook CreationCityDreamer: Compositional Generative Model of Unbounded 3D Cities
•
9:26
September 8th, 2023 - Unlocking the Future of AI: From Master Optimizers and Budget-Friendly Giants to Truthful Decoding and Video Segmentation Breakthroughs
Large Language Models as OptimizersFLM-101B: An Open LLM and How to Train It with $100K Budget
•
11:28
September 7th, 2023 - SLiMe, Matcha-TTS, RoboSense, and CM3Leon: Revolutionizing Vision, Speech, and Multi-Modal Intelligence for a Smarter, Faster Future
SLiMe: Segment Like MeMatcha-TTS: A fast TTS architecture with conditional flow matching
•
8:11
September 6th, 2023 - Unlocking the Future of AI: Lean Transformers, Memory-Efficient RLHF, Voice-Altering Text Prompts, and 3D Virtual Humans
One Wide Feedforward is All You NeedEfficient RLHF: Reducing the Memory Usage of PPO...
•
8:02
September 5th, 2023 - Frontiers in AI Efficiency and Capability: From Turbocharged Transformers and Extended Contexts to High-Definition Video Generation and Self-Tuned Learning
Fast Inference from Transformers via Speculative DecodingYaRN: Efficient Context Window Extension of Large Language Models
•
9:59
September 1st, 2023 - Unlocking Multilingual AI & Beyond: Innovations in Data, Synthesis, Bioinformatics, and 3D Creation
The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language VariantsAny-Size-Diffusion: Toward Efficient Text-Driven Sy...
•
10:46
August 31th, 2023 - Advancing Weather Forecasts, Robotic Learning, and AI Conversations: A Trilogy of Innovation
WeatherBench 2: A benchmark for the next generation of data-driven global weather modelsRoboTAP: Tracking Arbitrary Points for Few-Shot Visu...
•
6:44