The Next Abstraction Layer in AI: From Model Design to Model Generation

Simon Bennett

March 11, 2026

For the past decade, the AI conversation has centered on models.

Which architecture wins.
Which framework scales.
Which training strategy produces the best accuracy.

But something subtle is beginning to change. The frontier is shifting from building models to generating them automatically. And that shift may become one of the most important abstraction changes in AI infrastructure.

The Pattern Should Feel Familiar

If you’ve spent time in semiconductors, the pattern is almost obvious. Early chip design required engineers to work directly at the transistor level. Then came logic synthesis. Then RTL. Then, high-level synthesis. Each layer didn’t eliminate the need for engineering skill. It raised the level of abstraction, enabling engineers to design more complex systems. AI development is now entering a similar phase. Until recently, building machine learning systems required:

• selecting architectures
• tuning hyperparameters
• training and retraining models
• compressing and quantizing networks
• validating performance on target hardware

For edge and embedded systems, the process becomes even harder. Now add constraints like:

• memory footprint
• power budgets
• latency requirements
• heterogeneous silicon platforms

At that point, model development becomes less about theory and more about engineering tradeoffs. And those tradeoffs multiply quickly.

The Edge AI Bottleneck

In hyperscale cloud environments, you can afford inefficiency. If a model is too large, you scale infrastructure. But at the edge — inside cameras, sensors, industrial systems, and consumer devices — the constraints are unforgiving. Every milliwatt matters. Every megabyte matters. Every millisecond matters. Which means deploying AI at the edge often becomes an exercise in manual optimization and iteration.

Data scientists build the model. Embedded engineers try to make it run on hardware. Then everyone starts iterating. In many organizations, this process still takes months. Sometimes longer.

When Engineering Loops Become the Problem

This is where the next abstraction layer begins to appear. Instead of engineers manually exploring thousands of possible model configurations, a new category of platforms is emerging that automates the exploration. Think of it as:

AI systems that design AI models.

These systems can automatically:

• generate candidate architectures
• train models
• test them against real hardware constraints
• measure performance
• iterate until an optimal solution emerges

Instead of:

Data → Model → Deployment

The workflow becomes closer to:

Data → Automated Model Generation → Deployment

For teams building embedded AI systems, that difference can dramatically compress development timelines.

The Hardware Reality

There’s another reason this shift matters. AI models don’t run in a vacuum. They run on silicon. And as AI moves deeper into embedded systems, robotics, and industrial infrastructure, model design becomes inseparable from hardware constraints. The best architecture in theory may perform poorly on a specific device. The most accurate network may exceed memory limits. The fastest model may consume too much power. Which means optimization can’t happen only inside the training loop. It has to happen against the hardware itself. This is where platforms like ModelCAT are beginning to gain attention.

AI That Builds AI

ModelCAT approaches the problem differently. Instead of asking engineers to manually design and tune models, the platform focuses on automated model generation constrained by real hardware environments. The workflow is simple on the surface:

Upload your dataset. Define your target device and constraints. The system then:

• generates candidate models
• trains and evaluates them
• tests performance across hardware targets
• iterates until it finds an optimal solution.

The result is a model that is not only accurate, but hardware-aware from the start. For embedded AI teams, this changes the development equation. Instead of optimizing models after they are built, optimization becomes part of the generation process itself.

A Familiar Evolution

Seen through the lens of semiconductor history, this transition is not surprising. The complexity of modern systems eventually forces new abstraction layers. We saw it in:

• logic synthesis
• high-level design
• system-level architecture tools

AI engineering is now hitting the same complexity wall. And the response looks similar:

automation and abstraction.

ModelCAT is one of the companies exploring what that layer might look like.

Why This Matters Now

The timing is not accidental. AI is moving rapidly from cloud infrastructure into physical systems:

• robotics
• industrial automation
• automotive
• edge sensing
• intelligent devices

Every one of those environments introduces constraints that traditional ML workflows struggle to handle efficiently. Which means the companies that solve AI deployment complexity — not just model accuracy — may end up shaping the next generation of AI infrastructure.

The Watchtower View

At The Watchtower, we spend a lot of time looking for these inflection points. Moments where complexity forces a structural change in how technology is built. Edge AI may be approaching one of those moments now. As the number of intelligent devices continues to grow, the industry may no longer be able to rely on manual model design. Eventually the system will need to build itself. And that is exactly the direction platforms like ModelCAT are exploring.

Subscribe to The Watchtower Brief for more signals from the front lines of AI infrastructure, semiconductor design, and deep-tech go-to-market.