heading · body

Transcript

The Ai Bandwidth Wall And Co Packaged Optics

read summary →

TITLE: The AI Bandwidth Wall & Co-Packaged Optics CHANNEL: Asianometry DATE: 2025-08-10 ---TRANSCRIPT---

Over the past two decades, the semiconductor industry has successfully pushed the limits to deliver more raw compute. A wild success, really. A 2025 Nvidia B200 GPU has 178,000 times more Floating
Point Operations Per Second, or FLOPs than a top-of-the-line Intel CPU in the late 1990s. Unfortunately, we don’t fully benefit thanks to limits - or “walls”,
so to say - that keep us from taking full advantage of all that new raw power. One such wall is the memory wall

  • where semiconductor memories
    like DRAM have not scaled as fast as the compute has. I have covered this before. But in recent years, another bottleneck has presented itself: One relating to IO bandwidth.
    In response, the industry has been working on a silicon photonic solution. In today’s video, we talk about a technology trend
    that has been over a decade in the making: Co-packaged optics.

Beginnings

When it comes to data transfers out of a rack, there is an equation with two parts - silicon switch chips and the optics IO. A switch chip directs data coming in and out of the server,
forwarding them at high speed. It’s the heart of something like an Ethernet switch. Think of the switch chip as like an airport’s terminal - with tens
of thousands of passengers entering and leaving en route to their destination. Then we have the optics IO, which is where the data actually enters and exits. Today, the data inside the server rack travels as electric signals through thin
copper wire. But signals sent over a distance greater than 2 meters are better sent using
fiber. Fiber can carry more data over further distances with fewer signal integrity issues. The transition between electric and light is managed using a silicon photonics product
called a pluggable optical transceiver. We plug them into the switchgear’s faceplate
on each server rack, and they translate between digital optical and digital electrical signals. Inside a transceiver, there is a laser transmitter and modulator to encode and
send outgoing light signals, as well as a photodiode receiver to receive and interpret
incoming light signals. These together are referred to as the “optical engine”. The transceiver will also have components handling the electrical signals coming in from the server’s
switch chips. These are collectively referred to as the “electrical engine”. So all in all, if the switch silicon is like the airport terminal, then you can
think of the optical IO as like that terminal’s airport gates and roads. If the terminal is
too small - or if its roads or gates are too easily congested - then we get traffic jams.

Light and Silicon

Now switch chips are made from CMOS silicon, and so benefit from Moore’s Law. In recent years, switch chips have massively scaled up their
capacity as foundries like TSMC have shipped ever smaller nodes. But optical IO transceivers - despite having gone from 10 gigabits per second
20 years ago to 800 gigabits per second today - have grown nowhere near as fast.
Creating an obvious bottleneck because you can only plug so many into a switchgear. The industry is working on next-generation transceivers capable of doing 1.6 terabits
per second. Those are rolling out right now. After that,
the industry is going to spend a few years trying to earn back their investment. Meanwhile, switch chips in 2023 have capacities of 51.2 terabits per second.
And Broadcom is already seeding a switch chip with capacity of 102.4 terabits per second. And that number will very likely double again in another two years. The hard
truth of the matter is that Moore’s Law - even today - goes far faster than optics.

Faster and Faster

Why do we have this underperformance by the optical IO side? One of the bigger reasons is the Serdes. What is that? The data coming from those switches
is sent over in parallel - multiple data streams sent at lower speeds. But data can only be sent through the optical fiber serially. So we
need something to turn those parallel data streams into
a serial one. That something is called the serializer/deserializer, or Serdes. Imagine multiple streams of people approaching an
airport gate from different directions. The serializer is like the gate agent. They are arranging all these passengers into an orderly line that can get checked in through the
gate, that corridor bridge thing, and finally into their seat on the plane. The deserializer does the same thing but in the opposite direction. It
receives a serial signal that it converts into many parallel ones. At 800 gigabits per second, the electrical signals going
to the transceivers through the copper wires start experiencing serious integrity issues. So the Serdes needs to compensate by either using more sophisticated algorithms to pack
more bits or boosting weak signals via equalization. Both cost power, thus making the Serdes one of the
fastest growing consumers of power in the whole building. In 2013-14, Huawei estimated that in a 28 nanometer switch, the Serdes took up about 10-15%
of the switch’s area and power. Ten years later, Serdes now take up over twice that proportion. Nvidia’s VP of hyperscale Ian Buck told IEEE Spectrum in a 2025 interview that
pluggable optics now consume 10% of a GPU’s total compute power. Spread across thousands of these Serdes links, all that power consumed turns into
a real problem. Data center operators must work within power budgets pre-defined by
their fixed infrastructure. Power spent on networking cannot be used for GPUs.

Co-Packaged Optics

The general response has been to bring the electric and optical signals closer together. The Holy Grail is a small, manufacturable silicon photonics laser integrated right
into the chip itself to get data out as fast as possible. But this dream remains a dream. Silicon cannot easily generate or modulate laser
light. So the alternative has been to put the optic modules
like the transceiver as close as possible to the switch silicon. With the goal of bringing it inside the package itself - making it “Co-packaged” so to use the
term. As with anything relating to packaging, Co-Packaged Optics is a broad term that can mean
anything. There is a lot less standardization than in the mainline silicon industry. Co-packaged optics generally speaking means to package the switch silicon and optical modules
like the transceiver components together in the same package and on the same substrate. Again, the key idea is to shift away from these pluggable transceivers and
bring the photonic transition as close to the silicon as possible.
The less copper wiring all that data has to travel through, the better. Other benefits is that moving optical IO off the faceplate grants us more physical space. There are also some chances to eliminate redundant parts and reduce costs.

CPO Challenges

As with anything related to advanced packaging, co-packaged optics is no cakewalk. The single biggest concern is heat. Thermal issues in advanced packaging
has been a serious concern since the days of multi-chip modules. As we all know,
heat can warp or damage the packaging materials. But with optics, heat has an additional negative component. Light travels around
a photonic circuit through these thin channels called “waveguides”. Heat shifts the waveguides’
resonant wavelengths, causing them to stop working correctly at the laser’s
wavelength. So heat dissipation becomes doubly important here. And integrating the light source itself brings up crucial questions. Where
will the light source be? Off or on-chip? Having the light source be off-chip lowers
thermal risk and makes the whole thing more repairable, but introduces coupling issues. Coupling refers to the fact that the waveguides are very thin,
perhaps a few hundred nanometers across. But light arrives to the chip via optical
fibers that are about 10 micrometers wide

  • so maybe ten times larger or thicker. This is an obvious mismatch, and is like channeling water from a firehose into a
    bundle of straws. The delicate work of onboarding and offboarding that light is called “coupling”.
    Achieving the most precise alignment of the fibers is hard, and errors lead to light losses.

OBO and NBO

The technical challenges are formidable,
but can be overcome. The thing that’s harder to overcome is inertia. Generally speaking, the industry is conservative. The faceplate pluggable
transceiver has been used for years. They are cheap. They are easy to replace. And
there are well-established standards, giving you a broad range of vendors to pick from. Getting customers to adopt something so different from before is always a
pain in the butt. So like as with 3D stacking packaging,
the path to Co-Packaged Optics adoption has been long and winding. In 2018, an Microsoft-led industry consortium introduced On-Board Optics or Board-Mounted
Optics solutions. As the name implies, On-Board Optics remove the pluggable
transceivers and bring the optical engine directly onto the PCB with the switch chips. The data coming out of the switch goes down into the board and then through that to the
optical engine. Since it’s a shorter distance, there is better latency and some power savings. Then in 2020, companies started introducing the idea of “Near-Package Optics”. Here we
insert a high speed silicon interposer in between the packaged chip and optical engine. This is a middle step towards full Co-Packaged Optics. Kind of like how ASE and the Taiwanese
packaging industry put forth 2.5D integration as a middle step towards full 3D stacked logic.

TSMC and COUPE

In 2021, TSMC presented a co-packaged optics technology,
called COmpact Universal Photonics Engine or COUPE. COUPE 3D-stacks an electrical Integrated Circuit or IC on top of another “photonic IC”. Laser light from an outside diode is guided into that photonic IC where
it can be modulated before being sent out to the optical fiber. TSMC argues certain significant benefits with this. First,
it lets you make the electrical IC with an advanced node while making the photonic IC
with something less advanced like 65 nanometers. This saves on cost. Second, stacking the electrical IC on top of the photonic IC shortens the length of
the wires connecting them. This reduces parasitic capacitance and signal loss,
issues that might arise if the two ICs are sitting side-by-side on an interposer. TSMC claimed that COUPE improved the photonic engine’s power consumption
by 40% and raised its speed by 170% compared to other ways of connecting. So,
not a bad way to finally make that leap towards co-packaged optics. The issue again, however, was that nobody had a
reason to take that leap. Judging by the number of people who wrote
about COUPE when it was first announced - basically none - the same issues applied. TSMC’s 2022 annual report mentioned in a single,
brief paragraph that a few companies taped out COUPE chips in 2022 as a test. Same thing with its 2023 annual report.
TSMC mentioned that the number of test chips made with COUPE achieved expectations. Which,
again, makes me think that the customer reception was pretty mid.

AI Blows Up

That changed when ChatGPT and generative AI became the hottest thing since Mala and boba. Which you should not mix together. And as Nvidia began its march to becoming the world’s most valuable company. In 2023 and 2024, tech giants rushed to build billion-dollar AI factories
across the world. With those factories consuming unprecedented amounts of power,
vendors suddenly had a real economic technical reason to finally adopt Co-Packaged Optics. TSMC rushed to build out COUPE’s roadmap and tie it into the rest of their advanced packaging
ecosystem. At their 2024 North American Technology Symposium, they discussed a 3-stage roadmap that
first brings COUPE to pluggable transceivers. So a familiar, evolutionary step forward to start. Next, TSMC integrates COUPE into their flagship advanced packaging platform,
CoWoS. This lets vendors make electrical-photonic COUPE
chiplets and put them right next to switch silicon. Or even later on, CPU/GPU silicon. This dual-approach might sound wishy-washy on the surface,
but I reckon it lets companies dip their toe into this complicated Co-Packaged
optics solution without feeling like they are crossing some bridge of no return. Nvidia has been COUPE’s tip of the spear. At GTC 2025, Nvidia announced
two new switches - the Quantum-X InfiniBand platform and Spectrum-X Ethernet platform. While they have been in the switch industry since 2023,
these are their first switch products with co-packaged optics. Nvidia says that replacing pluggables with co-packaged optics grants their
equipment 3.5 times better power efficiency,
10 times better network resilience, and 1.3 times faster deployment.

Broadcom, GF, and the Startups

Of course, Nvidia and TSMC are not the only ones working on these type solutions for AI. Broadcom has their own sizable AI chip business, including making
semi-customized AI ASIC hardware for hyperscalers like the Google TPU chip. They are a technical pioneer,
having first brought Co-Packaged Optics to market with the 51.2 terabit Tomahawk 5. And as I mentioned earlier, they are relentlessly
pushing the market forward with faster and faster switch chips. And it has long been known that TSMC definitely has ground to make up in the
silicon photonics space behind other foundries like GlobalFoundries and Tower Semiconductor. The former especially is quite strong. Their GF Fotonix platform is now in its
third generation with a good PDK, manufacturing scale, and a strong
cluster of silicon photonic customers like Ayar Labs, Ranovus and Lightmatter. Ayar Labs in particular seems to be already doing what COUPE is planning to do over the next few
years. They have received investment money from both AMD and Intel (imagine that boardroom). They recently announced a new “optical interconnect chiplet”. With up to 8
terabits per second bandwidth and compatibility with the Universal
Chiplet Interconnect Express standard, it sounds like a pretty compelling product.

Conclusion

Co-Packaged Optics is another step forward in bringing light and electrons ever closer together. But does that mean we have a situation ahead where light does a complete takeover? As in a
pure photonic AI chip? The thought occasionally pokes its head in the news from time to time. Again, the issue as I see it involves both technology and economic inertia. Economically,
we have been doing silicon and electrons for 70 years now, and it is hard to move away from that. And technically. Many of the current heat and coupling issues get worse with
an all-photonic chip. Add to that a lack of scalable on-chip light
sources and issues relating to achieving certain logics with all-optical gates. And then, the electrical IC folks can also stay very competitive by adopting strong evolutionary
improvements like replacing some copper interconnects with Ruthenium for instance. So that all-photonic dream remains for now a dream. But the long-awaited rise of Co-Packaged
Optics is a marvelous step, and a real sign of how AI is remaking the world’s silicon supply chains.