Market InsightsJudd Walks #2010 min readMay 7, 2026

AI Just Past the Cute Phase: GPT-5.5 and the Workflow Shift

Judd Hoffman
Judd Hoffman

CEO, Ethica AI

Did you hear it?

Two weeks ago, OpenAI released GPT-5.5 and a new version of Codex, and that release was the loudest warning shot AI has fired into the workforce yet. Most people I know in business have not heard it. They will. Soon.

Here is why this release matters, and why it is different from every prior release.

Has AI moved past the cute phase?

Yes. AI has moved past the cute phase because the interaction is no longer limited to asking a question and reading an answer. The new workflow is giving AI a messy task and expecting it to plan, use tools, check the work, and carry the task through.

The cute phase was the version of AI most professionals have been comfortable with for two years. You ask a question. AI gives you an answer. You read the answer. You decide what to do with it. The interaction was contained. The work was still mostly yours. The AI was a smarter search engine wearing a chat interface.

That phase is over.

GPT-5.5 and Codex are not search engines. They are not chatbots. They are tools that take a messy, multi-part instruction and run it through to completion. They write the code. They check the code. They run the code. They debug the code. They notice when something is missing and fill the gap. They move across tools until the task is finished. The interaction is no longer "ask and receive." The interaction is now "give them the work and let them deliver the outcome."

OpenAI's own framing of the release is direct. The company says GPT-5.5 excels at writing and debugging code, researching online, analyzing data, creating documents and spreadsheets, operating software, and moving across tools until a task is finished. Their language is even more pointed when describing how to use it. Instead of carefully managing every step, you can give GPT-5.5 a messy, multi-part task and trust it to plan, use tools, check its work, navigate through ambiguity, and keep going.

Read that sentence again. Plan. Use tools. Check its work. Navigate through ambiguity. Keep going.

Those five capabilities, taken together, are not a chatbot. Those are the operational behaviors of a knowledge worker. Not all knowledge workers, but a meaningful subset of the routine, process-driven work that has historically required a junior or mid-level professional to execute.

Why does GPT-5.5 change the workflow?

GPT-5.5 changes the workflow because it moves AI from passive response into active execution. The previous generation made existing workers faster; this generation can complete structured work end to end when the goal is clear enough.

This is what I mean when I say AI is now changing the workflow. The previous generation of AI made existing workers faster. This generation does the work itself, end to end, in the categories where the work is structured enough to be specified clearly. That is a fundamentally different kind of leverage, and it is going to ripple through every industry that depends on routine knowledge work.

The benchmarks back it up. OpenAI reports that GPT-5.5 scored 82.7 percent on Terminal-Bench 2.0, a coding benchmark that measures whether an AI can complete real software engineering tasks in a real terminal environment. On FrontierMath, the hardest publicly available math benchmark, GPT-5.5 scored 51.7 percent on the first three problem tiers and 35.4 percent on the fourth tier. Six months ago, the best models in the world were scoring in the single digits on FrontierMath tier four. The capability curve is not slowing down.

OpenAI President Greg Brockman framed the release in plain language during the press briefing, and CNBC's launch coverage also captured the release context. He said the model can look at an unclear problem and figure out what needs to happen next. Note the phrase agentic computing. That is the part of the release that should land hardest on anyone running a business. AI is no longer waiting for instructions. It is acting on its own judgment about what needs to happen next.

That capability changes the math on every workflow that involves routine decision-making between steps.

Which workflows are now exposed to agentic AI?

Any workflow that involves planning, using tools, checking work, and continuing through routine steps is now exposed to agentic AI. That includes document drafting, data reconciliation, compliance review, request triage, spreadsheet work, coordination across systems, and software development.

Think about what that means for the day-to-day operations of any business. If an AI can plan, use tools, check its own work, and keep going, then any workflow that consists of planning, using tools, checking work, and continuing is now a workflow that AI can execute at meaningful quality. The list of workflows that fit this description is long. Drafting and revising documents. Pulling and reconciling data. Reviewing transactions for compliance. Triaging incoming requests. Building and updating spreadsheets. Coordinating across systems. Generating and refining code.

This is happening right now. Not in the future. Right now.

The companies that are seeing this clearly are already moving. They are not waiting for an industry analyst report. They are not waiting for a vendor to package up a perfect solution. They are pulling these tools into their workflows, testing them on real work, and figuring out which parts of their operation are now compressible by an order of magnitude.

The companies that are not seeing this clearly are going to be in a very different position by the end of this year. Not because AI replaces them. Because their competitors who started using these tools six months ago are going to be operating at a structurally different level of speed and capability, and the gap will be visible in the work, in the margins, and in the customer experience.

So if you see companies and products using AI right now, my advice is direct. Try it. Use it. Test it. Now is the time. Do not wait for the perfect tool. Do not wait for the use case to be obvious. Do not wait for someone to write a manual. Open the tools that exist this week. Run them on a real task. See what happens. Then run them on the next task. The professionals who do this every week are going to be operating at a level the professionals who do not cannot match.

What does this mean for real estate?

For real estate, this is not abstract. Agentic AI can already help with transaction files, counter-offer drafts, closing-statement reconciliation, follow-up emails, and comparable-sales organization because those workflows depend on documents, decisions, and repeated operational steps.

I want to bring this back to real estate, because that is the industry I am operating in every day.

The tools that just shipped are directly applicable. They will read a transaction file and surface the missing documents. They will draft a counter-offer based on the contract terms. They will reconcile the closing statement against the loan estimate and flag the variances. They will write the email follow-up to the buyer's agent that recaps the call and lays out the next three steps. They will pull the comparable sales for the listing presentation, organize them by relevance, and explain why each one matters. None of that is theoretical. All of it is doable today with the tools that just shipped.

The compounding piece matters even more than the individual tasks. An agent who runs all of those workflows through AI is not getting a 10 percent productivity boost. They are getting a multiplier across the entire transaction lifecycle. The agent who used to handle 20 transactions a year can now handle 40 or 50 with the same hours invested. The agent who used to spend their evenings catching up on paperwork can now spend their evenings with their family. The agent who used to hand off the boring parts of the work to an assistant can now handle those parts personally and either keep the assistant focused on higher-value work or operate with a smaller team while serving more clients.

The agents who run their week with these tools are going to look fundamentally different from the agents who do not. Not in a small way. In a large way. The work output per hour for an AI-equipped agent is materially higher than for a non-equipped agent in the same market, and the difference is visible to clients, brokers, and competitors. As the tools get more capable, that gap will widen. The agents who started experimenting six months ago are already feeling the compounding effect. The agents who are still on the sidelines are watching the gap open in front of them whether or not they recognize it.

The same is true for brokerages, title companies, lenders, and every other category of business that runs on documents and decisions and routine workflow.

What is the warning shot for business leaders?

The warning shot is that AI has shifted from answering questions to delivering outcomes. Leaders who still treat AI as something to revisit later are allowing earlier adopters to compound workflow capability faster than late starters can comfortably catch up.

Let me restate the warning shot, because the framing is important.

This is not the AI that you ask a question and get an answer. This is the AI that takes the work and delivers the outcome. The difference is enormous. The implications are operational, not theoretical. The window for treating AI as something you will get to later is rapidly closing, because the people who started earlier are now compounding capability faster than the late starters can close the gap.

I am not telling you to panic. I am telling you to pay attention. The tools that exist right now, the ones that just shipped, are not the same tools that existed six months ago. The capability curve has bent again. The professionals who are tracking the curve are adjusting their workflows in real time. The professionals who are not tracking the curve are going to be surprised by where the industry is by the end of the year.

Try it. Use it. Test it now. Now is the time. Let's do this together.

Judd Hoffman is CEO and Co-Founder of Ethica AI, building AI-powered tools for real estate transaction workflows.

Sources

  1. OpenAI: Introducing GPT-5.5: April 23, 2026 GPT-5.5 launch, Codex rollout, agentic workflow framing, capabilities, availability, Terminal-Bench 2.0, and FrontierMath benchmark results
  2. Tech Yahoo / CNET: ChatGPT 5.5 Is All About Math, Science and AI Research: Greg Brockman briefing quote about GPT-5.5 handling unclear problems and agentic computing at scale
  3. CNBC: OpenAI announces latest artificial intelligence model: Package-supplied source URL for launch reporting; browser access was blocked during local verification, so OpenAI and accessible syndicated reporting carry the verified claims

Quick Takes

Who is Judd Hoffman?

Judd Hoffman is CEO and Co-Founder of Ethica AI, a company building AI-powered voice tools for real estate transaction workflows, backed by the California Association of REALTORS. He has nearly three decades of operating experience, including more than 15 years across real estate title, transactions, and technology.

What is Ethica AI?

Ethica AI is a real estate technology company building VoicePilot, an AI-powered tool that allows real estate agents to complete transaction forms by speaking naturally instead of filling out PDFs manually. VoicePilot is backed by the California Association of REALTORS as a free member benefit for more than 190,000 members.

When was GPT-5.5 released?

OpenAI released GPT-5.5 on April 23, 2026, alongside availability in Codex, OpenAI's coding tool. OpenAI said GPT-5.5 was rolling out to ChatGPT and Codex for Plus, Pro, Business, and Enterprise users, with API access coming soon. OpenAI describes GPT-5.5 as its smartest and most intuitive model to date.

What is Codex?

Codex is OpenAI's coding tool and assistant. It was upgraded alongside the GPT-5.5 release in April 2026 and is designed to support writing, debugging, testing, and executing code in real terminal and codebase environments. Codex supports agentic coding workflows where the model plans, uses tools, checks its own work, and continues across multi-step tasks until completion.

What does it mean that AI is agentic?

Agentic AI refers to AI systems that can plan a sequence of actions, use tools, check their own work, and continue without step-by-step human direction. According to OpenAI, GPT-5.5 represents a step forward in getting work done on a computer because users can give it messy, multi-part tasks and let it plan, use tools, check its work, navigate ambiguity, and keep going. This contrasts with earlier AI usage that mostly answered questions.

How does GPT-5.5 perform on benchmarks?

According to OpenAI, GPT-5.5 scored 82.7 percent on Terminal-Bench 2.0, a coding benchmark that measures complex command-line workflows requiring planning, iteration, and tool coordination. OpenAI also reported FrontierMath scores of 51.7 percent on tiers 1 through 3 and 35.4 percent on tier 4. Those results point to stronger performance on agentic coding and advanced reasoning tasks.

How can a real estate agent use these tools today?

Agentic AI tools can read a transaction file and surface missing documents, draft counter-offers based on contract terms, reconcile closing statements against loan estimates, write follow-up emails to other agents, and pull comparable sales for listing presentations. Real estate agents can use these tools today on real workflows, and the compounding effect of consistent use creates significant capacity gains across the transaction lifecycle.

Full Transcript

Well, do you hear it? It was the loudest warning shot from ChatGPT 5.5 and Codex. So basically, AI has moved past the cute asking a question and giving you an answer. Now it is writing the code. It is checking the code, it is filling the gap and it's changing the workflow. So anybody out there, if you see companies and you see products using AI, try it, use it, test it, now's the time. Let's do this.

Judd Hoffman

Judd Walks

A video series from Ethica AI CEO Judd Hoffman. New episodes drop on LinkedIn.