Scientists should use AI as a tool, not an oracle
How AI hype leads to flawed research that fuels more hype
OpenAI Racks Up Data and Content Deals with News Corp, Vox, and The Atlantic
More high-quality data for OpenAI, less incentive to give into NYT demands
Data Machina #255
New Trends in AI-RAG and Graphs. GRAG. GNN-RAG. Property Graph. Unified RAG+LangGraph. GenAI Mindset. Transformer Agents 2.0. Falcon 2.0 11B LLMS/ VLMS. ToonCrafter. MusePose. ColdFusion. SymbCoT.
Developing an LLM: Building, Training, Finetuning
This is an overview of the LLM development process. This one-hour talk focuses on the essential three stages of developing an LLM: coding the architecture…
LLM Research Insights: Instruction Masking and New LoRA Finetuning Experiments?
This article covers three new papers related to instruction finetuning and parameter-efficient finetuning with LoRA in large language models (LLMs). I work…
Google Gemini 1.5 and Flash LLMs Show Significant Advances Hidden in Research
Progress in math, instruction following, and long-tail expertise
Scale AI’s $1B Funding Round Highlights a New Phase in the Data and AI Wars
The shift from data volume to data quality is well underway
Replicate Intelligence #2
Faster image generation, AI-powered world simulator, insights on AI dataset complexity