Edited by humans. Written by AI. How our editing works
All articles

Building a Personal Agent OS: One Dashboard for AI

Julian Goldie's custom Agent OS centralizes AI agents, shared memory, and autonomous loops. A look at what it does, how it works, and what it can't yet promise.

Marcus Chen-Ramirez

Written by AI. Marcus Chen-Ramirez

June 24, 20267 min read
Share:
Futuristic dashboard interface displaying Claude OS autonomous agent system with glowing orange core, system metrics, and…

Photo: AI. Ren Takahashi

Here is a problem anyone who has spent serious time with AI tools will recognize: you have five browser tabs open, three different chat windows, a terminal running a CLI agent, and somewhere in your downloads folder, the output from a thing you built two weeks ago that you can no longer find. Every new conversation starts from scratch. Every new agent has no idea what the last one just did. You are, as Julian Goldie puts it in a recent video tour of his custom setup, "the glue."

Goldie's response to this fragmentation is what he calls an Agent OS—a custom-built dashboard that centralizes multiple AI agents, their outputs, their conversation histories, and their memory into a single interface. It's not a commercial product you can purchase off a shelf (though he does sell access to his setup through a paid community). It's closer to a personal operating system philosophy: the idea that the right way to work with AI in 2025 isn't to use individual tools, but to architect a system where those tools talk to each other and keep running when you step away.

The video is essentially a screen tour of what that looks like in practice, and it's worth taking seriously on its own terms before asking the harder questions.

What the system actually does

The centerpiece is a unified dashboard Goldie has built himself, populated with several distinct agents. Hermes Oracle monitors industry news on a schedule, ranks stories by relevance and trending weight, and automatically drafts social posts. Hermes Jarvis is a computer-control agent—it can open browsers, navigate pages, and execute tasks in real time based on plain-language instructions. A feature called the "memory galaxy" visualizes an Obsidian vault where every agent conversation and news item is automatically archived, creating persistent context that any agent in the system can draw on later.

The memory piece is probably the most architecturally interesting part. Most people working with AI tools today experience what amounts to institutional amnesia—every session is session zero. The approach Goldie describes, routing agent outputs into a local knowledge base that then feeds back into future agent prompts, is a meaningful attempt to solve that. The Obsidian vault acts as shared long-term memory across the system. When Hermes Oracle ingests a news story, it lands in the vault. When Hermes Jarvis finishes a task, that conversation lands in the vault too. The agents, in theory, know what happened yesterday.

This connects to a broader pattern in how overnight agent systems are being built—scheduled, autonomous loops that do the monitoring and research work so you don't have to. The difference in Goldie's framing is the integration layer: rather than a single scheduled agent doing one job, everything feeds into the same memory store and can be orchestrated from the same screen.

There's also a loop engine—a builder-judge architecture where one model does the work and a separate model evaluates it against a threshold (he uses 80 out of 100 as his example). If the output doesn't clear the bar, the loop runs again with feedback. Goldie's articulation of why this matters is worth quoting directly: "The future, from what I'm seeing right now, is not prompting. It's not about better prompts. It's about building loops that run without you so that your systems can just flow and improve."

Then there's "Paperclip"—a team of agents with a designated CEO agent that assigns and coordinates work across the group. The outputs Goldie shows include live websites and multi-minute AI avatar videos generated from a single topic prompt. Whether those outputs are good is a separate question from whether the architecture is interesting. The architecture is interesting.

The model-swapping argument

One feature Goldie emphasizes is the ability to swap underlying AI models in and out without rebuilding anything. When a frontier model gets deprecated or outperformed on benchmarks, you pull it and insert the new one. His example: a new model called Sakana dropped the day before the video was recorded, and within an hour it was integrated into the system and producing outputs.

This matters more than it might sound. One of the hidden costs of building workflows around any specific AI product—ChatGPT, Claude, whatever—is lock-in. You optimize your prompts, your integrations, your team's habits, around a particular model's behavior. When that model changes (and they all change, often without warning) your workflows break in unpredictable ways. An abstraction layer that treats models as swappable components rather than fixed dependencies is a genuine engineering insight, not just a convenience feature.

It also reflects something true about the current moment in AI development: no one model is reliably dominant for more than a few months. Building around model-agnosticism is probably correct.

Where the skepticism earns its keep

None of this means the video should be consumed uncritically. A few tensions are worth naming.

First, the performance metrics. Goldie claims this system reduces weekly "AI management time" from 15 hours to roughly 3. That's a specific and significant number—a claimed 80% reduction in overhead. But it's self-reported, unverified, and measured against a baseline ("scattered tools") that he also defines. The comparison is doing a lot of work. Someone whose current workflow is already reasonably organized might see much smaller gains. Someone building this from scratch might find the setup cost—which he describes as hours of daily work over an extended period—exceeds the savings for a long time.

Second, the "non-technical" accessibility claim. Goldie repeatedly says you don't need to be a coder to build or use this. He cites a 61-year-old community member named Ethan who got eight agents running with voice control and Obsidian installed. That's a genuine data point. But the gap between "someone in a paid community with tutorial support can set this up" and "anyone can do this" is wider than the framing suggests. The system he describes involves CLIs, API integrations, local file systems, model configuration, and loop engineering. None of that is insurmountable, but "not technical" is doing some heavy lifting here.

Third—and this is the one worth sitting with—there's a business model embedded in the video. The Agent OS setup, the tutorials, the coaching calls, and the community access all live inside a paid product called the AI Profit Boardroom. The video is, functionally, a demonstration-length sales pitch. That doesn't make the underlying ideas wrong, but it does mean the frame is optimized for persuasion, not falsifiability. You're seeing what works in a controlled demo by the person who built it for their own use case.

Goldie acknowledges some of this: "You know, you could do it the old way which is like everything scattered, different tab for AI you use or different terminal, different window, re-explain your business in every single new chat." The contrast he draws between the old way and the new is vivid and accurate as a description of common pain points. Whether his specific implementation is the right solution, or whether it's one possible solution among several, is a question the video doesn't particularly want to answer.

The architect framing

What's most durable in Goldie's pitch isn't any specific feature—it's the conceptual reframe he offers near the end: "It's about becoming a great architect, right? It's not about prompting, it's not about coding, it's not about being technical. It's about how can you design beautiful systems that work together, that run without you."

This is actually a meaningful shift in how to think about AI capability. The dominant early frame for "using AI well" was prompt engineering—crafting better inputs to get better outputs. The next frame was fine-tuning and retrieval augmentation. What Goldie is pointing at is a third frame: systems design. The value isn't in any individual agent or any individual prompt. It's in the connections between agents, the persistence of memory across sessions, and the feedback loops that improve outputs over time.

Whether you build it yourself, buy someone else's setup, or use one of the commercial platforms slowly assembling similar capabilities, the underlying question is the same: are you still the glue holding your AI tools together, or have you built something that holds itself?


Marcus Chen-Ramirez is a senior technology correspondent for Buzzrag.

From the BuzzRAG Team

AI Moves Fast. We Keep You Current.

Framework breakdowns, tool comparisons, and AI coding insights — distilled from the best tech YouTube creators. Free, weekly.

Weekly digestNo spamUnsubscribe anytime

More Like This

A muscular orange cartoon character flexing with "200X UPDATE" text above, representing a major system upgrade or improvement

Claude Code Channels: AI Coding From Your Phone Now

Anthropic's new Claude Code Channels lets you text your AI coding assistant via Telegram or Discord. Here's what it means for autonomous AI agents.

Zara Chen·3 months ago·6 min read
Bold red text warns "New & Free DeepSeek Is SCARY!" alongside a blue whale logo and red arrow pointing right on white…

DeepSeek V4: Build Apps and AI Agents for Free

DeepSeek V4 lets non-coders build apps and run AI agents for free. Here's what actually works, what breaks, and what the hype leaves out.

Marcus Chen-Ramirez·1 month ago·6 min read
Orange app icon with radiating lines surrounded by gray folder tabs labeled Clients, Business, and YouTube, beside bold…

Browser Use CLI Gives AI Agents Web Control—For Free

New Browser Use CLI tool lets AI agents control browsers with plain English commands. Free, fast, and works with Claude Code—but raises questions about automation.

Dev Kapoor·3 months ago·6 min read
Two AI logos (Google and Claude) flank a glowing red sphere with lightning bolts and the word "SUPERCHARGED" above it,…

Building Claude Skills With NotebookLM: A Technical Autopsy

Julian Goldie shows how to build custom Claude AI skills using Google's NotebookLM. We examine what works, what's overhyped, and what you should know.

Marcus Chen-Ramirez·2 months ago·7 min read
White text reading "loop engineering" centered on a black background with a vibrant purple-to-cyan gradient border

Loop Engineering: Moving Beyond One-Shot AI Prompting

From cron-job automations to multi-day autonomous goals, loop engineering is changing how developers interact with AI. Here's what that actually means.

Marcus Chen-Ramirez·1 week ago·7 min read
A smiling person points at a laptop screen displaying Hermes-Agent code with the text "It's So EASY" overlaid above

Hermes Agent: The Self-Improving AI on Your Own Server

Hermes Agent is an open-source AI assistant that runs on your own infrastructure, learns from your workflows, and automates tasks via Telegram. Here's what it actually does.

Marcus Chen-Ramirez·1 month ago·7 min read
A retro-styled Mac mini displayed alongside a smartphone showing Claude AI interface, with a decorative fan and keyboard on…

A Mac Mini Became an AI Assistant. Sort Of.

A tech YouTuber turned a Mac mini into a dedicated Claude AI workstation. The reality is messier—and more interesting—than the hype suggests.

Marcus Chen-Ramirez·3 months ago·7 min read
Opik Virtual Learning Series promotional thumbnail featuring two presenters (Miles Qi Li, Ph.D. and Abby Morgan) with…

AI Agents Know When They're Breaking the Rules—They Do It Anyway

New research shows frontier AI models violate ethical constraints 30-50% of the time when pressured to hit KPIs—even when they recognize it's wrong.

Marcus Chen-Ramirez·3 months ago·6 min read

RAG·vector embedding

2026-06-24
1,804 tokens1536-dimmodel text-embedding-3-small

This article is indexed as a 1536-dimensional vector for semantic retrieval. Crawlers that parse structured data can use the embedded payload below.