April 17, 2026

Glif Team

New Tools, Queued Messages & Polish

A batch of improvements since Glif 2.0 — a bunch of new native tools, a smarter media library, and general polish

Since we launched Glif 2.0 a few weeks ago, we've been heads-down shipping. Here's a roundup of the public-facing changes.

Queue up messages while the agent works

You no longer have to wait for the agent to finish before typing your next instruction. Hit enter while a turn is in progress and your message joins a queue that's sent as soon as the agent's ready. You can edit or cancel queued messages before they run, and there's an indicator above the composer when you're editing one in place.

New native tools

A big batch of new agent capabilities across image, video, and audio:

Images

  • Z Image (with LoRA + ControlNet support) — this model punches way above its weight class. Fast, cheap, very good. Supports text-to-image, image-to-image, and ControlNet modes with optional LoRA.
  • Quiver — generate real SVGs from text or reference images. Clean, production-ready vector output, infinitely scalable. Great for logos, icons, and illustrations.
  • HTML to Image — have the agent write HTML + CSS and get back a rendered PNG. Ultra-useful for social cards, posters, infographics, and any layout you want repeatable and pixel-perfect.

Video

  • PixVerse V6 — a strong all-rounder that's especially good at action scenes. Image-to-video with native audio (background music + SFX, not speech), plus style presets (anime, 3D animation, clay, comic, cyberpunk). Also available as a keyframe transition model.
  • Seedance 2.0 — ByteDance's latest text/image-to-video model with native audio, in four variants (standard, Fast, Reference, Fast Reference).
  • Remotion — have the agent write a React component and get back a fully rendered video. Programmatic, repeatable, extremely powerful and versatile.
  • Fabric 1.0 LipSync — fast, clean lip-sync from a portrait image and an audio clip. Another solid lip-sync option in the arsenal.
  • Remove Video Watermark — strips watermarks from videos via WaveSpeed, billed by duration (clips up to 10 minutes).

Audio

  • Google Gemini TTS — premium-quality text-to-speech powered by Gemini 3.1 Flash. 30 voices, 70+ languages (auto-detected), and inline audio tags for expressive control like [whispers], [excited], [laughs], and [sighs].

A better media library

Your library is easier to navigate:

  • Group by chat — the library now groups media by the chat that created it (this is the default), so you can jump back to the source conversation. Group-by-date and ungrouped views are still available.
  • Like anything — hit the heart on any output in chat or in the library to save your favorites, then flip the "Liked only" toggle in the library to filter down to them.

General UI improvements

A broad polish pass across the app, plus a bunch of smaller fixes:

  • Refreshed dark mode with a cooler palette
  • Cleaner settings pages
  • More reliable video player
  • Videos in galleries now display at their true aspect ratio
  • Attached media inside chats lays out as a clean grid
  • Tidier chat layout — task groups, code blocks, and inline forms
  • Better markdown rendering, including tables, blockquotes, and tool-task thumbnails
  • Refreshed favicon

More coming soon. As always, drop into our Discord and let us know what you're making (or what's broken).