AI Infrastructure - Provide.ai

ai data centers, AI Funding & Investment, AI Infrastructure, AI Startups, quantum AI, quantum computing, Sygaldry Technologies

Sygaldry Raises $139M for Quantum AI Infrastructure

Mohib Rehman / May 18, 2026

Insider Brief Sygaldry Technologies, Inc. announced today that it has raised $139M in Series A and Seed financing to build quantum-accelerated AI servers. The $105M Series A in March 2026 was led by Breakthrough Energy Ventures, following a $34M seed round led by Initialized Capital. Sygaldry’s servers aim to exponentially speed up critical AI algorithms, […]

AI Infrastructure, AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, context-engineering, deep-learning, Editors Pick, Language Model, Large Language Model, Machine Learning, software-engineering, Staff, Tech News, Technology

Nous Research Proposes Lighthouse Attention: A Training-Only Selection-Based Hierarchical Attention That Delivers 1.4–1.7× Pretraining Speedup at Long Context

Asif Razzaq / May 16, 2026

Nous Research has published Lighthouse Attention, a selection-based hierarchical attention mechanism that wraps around standard scaled dot-product attention during pretraining and is removed afterward. Unlike prior methods such as NSA and HISA that pool only keys and values, Lighthouse pools Q, K, and V symmetrically across a multi-resolution pyramid, reducing the attention call from O(N·S·d) to O(S²·d) and running stock FlashAttention on a small dense sub-sequence. Tested on a 530M Llama-3-style model at 98K context, it achieves a 1.40–1.69× end-to-end wall-clock speedup against a cuDNN SDPA baseline with matching or lower final training loss.

The post Nous Research Proposes Lighthouse Attention: A Training-Only Selection-Based Hierarchical Attention That Delivers 1.4–1.7× Pretraining Speedup at Long Context appeared first on MarkTechPost.

Agentic AI, AI Infrastructure, AI Shorts, Artificial Intelligence, Editors Pick, generative-ai, Language Model, Machine Learning, New Releases, Open Source, software-engineering, Staff, Tech News, Technology

Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production

Asif Razzaq / May 16, 2026

Running AI agents in a local script is straightforward. Running them reliably in production across teams, across restarts, with isolated environments per context is a different problem entirely. BerriAI, the company behind the LiteLLM AI Gateway, is now open-sourcing a purpose-built answer to that problem: the LiteLLM Agent Platform. The platform is described as a […]

The post Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production appeared first on MarkTechPost.

AI Infrastructure, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Machine Learning, software-engineering, Staff, Tech News, Technology

Zyphra Releases ZAYA1-8B-Diffusion-Preview: The First MoE Diffusion Model Converted From an Autoregressive LLM With Up to 7.7x Speedup

Asif Razzaq / May 15, 2026

Zyphra’s latest release shows that an autoregressive MoE model can be converted into a discrete diffusion model with no systematic loss in evaluation performance. ZAYA1-8B-Diffusion-Preview achieves up to 7.7x inference speedup over autoregression by shifting decoding from memory-bandwidth bound to compute-bound — a key advantage as modern GPUs continue scaling FLOPs faster than memory bandwidth.

The post Zyphra Releases ZAYA1-8B-Diffusion-Preview: The First MoE Diffusion Model Converted From an Autoregressive LLM With Up to 7.7x Speedup appeared first on MarkTechPost.

AI Infrastructure, Artificial Intelligence, Inference, llm, Machine Learning

DFlash: The Trick That Makes LLMs Stop Crawling One Token at a Time

ABV — Applied AI Reviews / May 15, 2026

Speculative decoding was already clever. DFlash makes the draft stage parallel, turning diffusion from a clumsy text generator into a very…Continue reading on Medium »

AI hardware, AI Infrastructure, ai-architecture, automation news, autonomous systems, Communications, humanoid market, humanoid robotics, humanoid robots, Humanoids, industrial ai, industrial robotics, mipi alliance, Physical AI, physical AI systems, robot sensors, robotics and automation, robotics and automation news, robotics ecosystem, robotics industry, robotics interfaces, robotics news, robotics standards, robotics technology, semiconductor industry, Technology, Yole Group

MIPI Alliance launches ‘Physical AI Birds of a Feather group’ focused on humanoids

David Edwards / May 15, 2026

The MIPI Alliance, an international organization that develops specifications that standardize wired interfaces for mobile and other connected ecosystems, has announced the formation of a “Physical AI Birds of a Feather (BoF) group” dedicated to explor…

AI Infrastructure, Artificial Intelligence, Machine Learning, product-engineering

La latencia compuesta: el problema que no aparece en ningún benchmark

Dayanna Verónica / May 14, 2026

Hay un momento muy específico en cualquier producto de IA en tiempo real que no aparece en ningún benchmark. No está en los dashboards, no…Continue reading on Medium »

Agentic AI, AI Infrastructure, AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Machine Learning, New Releases, Staff, Tech News, Technology

Nous Research Releases Token Superposition Training to Speed Up LLM Pre-Training by Up to 2.5x Across 270M to 10B Parameter Models

Asif Razzaq / May 14, 2026

Nous Research releases Token Superposition Training (TST), a two-phase pre-training method that cuts wall-clock training time by up to 2.5x at matched FLOPs by averaging contiguous token embeddings into bags during Phase 1 and reverting to standard next-token prediction in Phase 2 — without changing the model architecture, tokenizer, optimizer, or inference-time behavior. Validated at 270M, 600M, 3B dense, and 10B-A1B MoE scales.

The post Nous Research Releases Token Superposition Training to Speed Up LLM Pre-Training by Up to 2.5x Across 270M to 10B Parameter Models appeared first on MarkTechPost.

AI Infrastructure, ai-ethics, ai-governance, Applications, Artificial Intelligence, Editors Pick, enterprise-ai, Staff, Technology

Enterprise AI Governance in 2026: Why the Tools Employees Use Are Ahead of the Policies That Cover Them

Michal Sutter / May 13, 2026

63% of organizations have no AI governance policy. Shadow AI is already running inside your stack — here is the data.

The post Enterprise AI Governance in 2026: Why the Tools Employees Use Are Ahead of the Policies That Cover Them appeared first on MarkTechPost.

Agentic AI, AI Infrastructure, AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, New Releases, Open Source, Security, software-engineering, Staff, Tech News, Technology

Fastino Labs Open-Sources GLiGuard: A 300M Parameter Safety Moderation Model That Matches or Exceeds Accuracy of Models 23–90x Its Size

Asif Razzaq / May 13, 2026

Fastino Labs has released GLiGuard, a 300M parameter open-source safety moderation model that evaluates four safety tasks — prompt safety, jailbreak strategy detection, harm category classification, and refusal detection — in a single forward pass. Built on an encoder architecture rather than the decoder-only design used by most guardrail models, GLiGuard achieves up to 16x higher throughput and 16.6x lower latency than current state-of-the-art models, while matching or exceeding the accuracy of models 23 to 90 times its size across nine safety benchmarks. Model weights are available under the Apache 2.0 license on Hugging Face.

The post Fastino Labs Open-Sources GLiGuard: A 300M Parameter Safety Moderation Model That Matches or Exceeds Accuracy of Models 23–90x Its Size appeared first on MarkTechPost.