Matthew Diakonov, Written with AI

Published April 28, 202611 min read

Smart vs standard, the retrofit path

The line between standard CCTV and smart surveillance is not in the camera. It is in the device behind the camera. That is why a retrofit usually beats a replacement.

Most existing camera fleets are five to ten years old, the cameras are physically fine, and the system is functioning as a recorder. Vendors selling new smart cameras frame the upgrade as a hardware story. It is not. The hardware difference is modest; the intelligence difference is the whole story. This page walks through what makes a system smart, why an edge overlay captures most of that benefit on existing infrastructure, and what survives in the retrofit story when the network goes down.

See a retrofit on a five year old fleet

4.9from 50+ properties

Reads the existing recorder's HDMI feed; cameras stay where they are

Adds person, vehicle, and package detection without per-camera RTSP setup

Per-camera arming windows; per-zone dwell rules

WhatsApp and SMS alerts in seconds; events persisted locally during outages

What 'smart' actually means in surveillance

Vendor marketing has overloaded 'smart' to mean almost anything, but on the ground there are only two functional capabilities that matter. The first is content recognition: can the system look at a frame and tell you what is in it (a person, a vehicle, a package, a face) instead of just storing the pixels. The second is action: when the system recognizes something, can it do something (push an alert, flag a clip, suspend a recording, ping an operator) without a human being in the loop.

A camera that does both is a smart camera. A camera that records pixels and hands them to a human reviewer is a standard camera. The recorder behind it is the same in both cases. The cabling, the power, the lens, the housing, all of that is shared between the two worlds. The intelligence is what is different, and the intelligence does not have to live in the camera body.

That last sentence is the architectural insight that makes retrofitting work. If intelligence can live in a separate device, then the cameras and recorder can be commodity, the intelligence can be a swappable component, and an upgrade does not require pulling cable. The same insight is why modern data centers split storage from compute and why modern phones split LTE radios from app processors. Everything gets cheaper when the smart layer is decoupled from the dumb layer.

The two ways to get smart on an existing fleet

Worth being explicit about the alternative. The retrofit path is not the only path; it is just the cheapest one for the common case where the existing cameras and recorder are functional. The replacement path makes sense in narrow cases and is a bad default everywhere else.

Feature	Replace cameras with smart cameras	Edge AI overlay retrofit (Cyrano style)
Cameras	16 new smart cameras at 400 to 1,200 each	Existing 16 cameras stay in place
Cabling	Audit and patch every drop; sometimes pull new	Untouched
Recorder	Often new NVR plus managed switch	Existing recorder keeps doing forensic record
Where intelligence lives	On each camera (or in the cloud, depending on vendor)	On a single small box near the recorder, reading HDMI
Install effort	Two to four days, partial coverage during cutover	About an hour, no cutover, no coverage gap
Upgrade later	Replace cameras again next time	Swap the edge box, cameras stay
All-in cost on a 16 camera property	12,000 to 25,000 dollars	2,500 to 5,000 dollars plus monthly software

What goes on the wall after the retrofit

One small fanless box, sitting next to the recorder, plugged into the recorder's HDMI port and the office monitor's HDMI loop-out. The recorder believes it is showing the trailer monitor; the overlay quietly captures the same frames and runs them through a detection model. There is no recorder configuration change. The cameras have no idea anything has happened. The intelligence is entirely on the new box.

Retrofit on a five year old fleet

Drop the box
Fanless mini PC on top of the rack, single power cable, single HDMI input from the recorder.
Learn the layout
Auto-detect the active grid (4x4, 3x3, 2x2 single), bind each tile to the recorder's name strip, persist the binding.
Define zones and windows
Per camera, draw the zones that matter (gate, lobby, mailroom, parking) and the arming windows (after hours, weekends).
4
Wire the alert path
WhatsApp number, SMS fallback, optional dispatch escalation. First test alert in the same hour the install started.

What survives the retrofit, and what does not

The retrofit is not lossy in the way some upgrades are. The forensic record is preserved (the existing recorder keeps writing). The optics are preserved (the cameras keep doing what they were already doing). The wiring is preserved (literally untouched). What is added is the intelligence layer and the alert path.

preserved by the retrofit:

Existing camera optics and aim
Existing cabling and PoE switches
Existing recorder and forensic disk array
Existing wall monitor on the loop-out
Existing rack power and UPS

added by the retrofit:

Person, vehicle, and package detection per camera
Per-zone and per-dwell rules
WhatsApp and SMS alerts with thumbnails
Local outbox so events survive uplink outages
Structured event log for incident review

20 / 30 days

“On a 180 unit Class C multifamily property in Fort Worth, the existing camera fleet had been recording untouched for years. Adding an overlay that watched the same DVR HDMI feed surfaced 20 incidents in the first 30 days, including a break-in attempt that the recorder had captured but nobody had reviewed. Same cameras, same recorder. The only difference was the smart layer.”

Cyrano field notes, Class C multifamily retrofit

What the operator actually sees

The dashboard is not the product. The product is the message that arrives in the operator's WhatsApp at 2:14am, with a thumbnail showing the gate, a one line caption (gate-main, 12s dwell, person present), and a link to the clip. The dashboard exists for the audit trail and for the rare incident review; it is not what anyone looks at day to day.

0to 15 seconds, event to alert

0small box per recorder

0+cameras typically handled per box

0DVR or camera config changes

When replacing the cameras is the right move

Worth saying clearly: there are cases where the camera itself should be replaced. If the housing is cracked, the lens is fogged, water has gotten in, the IR ring is half dead, or the sensor is showing banding artifacts at night, no overlay will help. The frame coming out of the recorder is too degraded for detection.

For everything else (the typical 1080p IP camera installed in the last decade) the retrofit is the cheaper, faster, and more durable upgrade. The smart layer is a separate component; when the next model generation ships, the operator swaps the box, not the entire fleet. That is what makes the architecture work across decades of camera generations.

The architectural point

Decouple the smart layer from the dumb layer.

Storage got cheaper when it was decoupled from compute. Phone networks got cheaper when the radio was decoupled from the app processor. Surveillance follows the same pattern. The cameras and the recorder are commodity. The intelligence is the swappable component. A retrofit architecture is the way to live in that world without replacing the commodity every time the intelligence improves.

Make a five year old fleet smart, without replacing it

A 15 minute call. We connect to a real existing recorder, draw zones, and fire a test WhatsApp alert. No cable runs, no cutover.

Smart vs standard surveillance: frequently asked questions

What actually separates 'smart surveillance' from 'standard CCTV'?

Two things. First, whether the system can recognize semantic content in a frame (a person, a vehicle, a package, a face) instead of just storing pixels. Second, whether the system can act on that content (push an alert, pause a recording, throw a flag) without a human being in the loop. Standard CCTV records pixels; smart surveillance recognizes content and acts. The hardware difference is smaller than vendors imply. The intelligence difference is the whole story.

If smart cameras already exist, why retrofit instead of replacing?

Because the average existing fleet is five to ten years old, the optics are still adequate for most detection tasks (1080p covers person and vehicle detection comfortably), the cabling is in place, and the cameras are aimed and tested. The labor to rip out and replace a 16 camera property is in the 12,000 to 25,000 dollar range, plus a coverage gap during the cutover, plus the schedule cost of getting the right ladder rentals during business hours. The retrofit captures the same intelligence gain by adding one box that reads the existing recorder's HDMI output. The cameras stay; only the analytics layer changes.

Why is reading the recorder's HDMI feed a robust retrofit path instead of talking to each camera directly?

Because every recorder has a video output, but not every recorder has a working RTSP path. RTSP varies by manufacturer, often requires per-camera credentials, sometimes sits behind a license tier, and frequently fails when the camera firmware updates and the credentials silently break. HDMI is the universal output: all recorders have it, the trailer or office monitor is already plugged into it, and the overlay box can sit in the loop. Setup is one cable. Configuration is on the overlay only; the recorder is untouched.

What does 'smart' look like once the overlay is running on a standard CCTV fleet?

The cameras still record exactly as before. The recorder still writes to its disk array. What changes is the delivery surface. Instead of a wall monitor that nobody watches, the operator gets WhatsApp messages with a thumbnail when a defined event fires: a person at the gate at 2am, a vehicle in the dock area after hours, a person dwelling near the trash room for 30 seconds. Every alert is one line in a structured event log. The pixels never go to the cloud; only the events and the small thumbnails do.

What is the upgrade math on a 16 camera multifamily property?

Replacing all 16 cameras with smart cameras lands at 12,000 to 25,000 dollars all-in including labor, cable patch, and the new NVR if needed. The retrofit overlay lands in the 2,500 to 5,000 dollar range plus monthly software, with no cabling work. The intelligence gain is roughly the same; the optics gain on the new cameras is often invisible at the actual scene distances. The retrofit collapses the upfront capex by roughly an order of magnitude and ships the same outcome to the operator's WhatsApp inbox.

When does it make sense to replace cameras instead of retrofitting?

When the cameras are physically failing (water ingress, dead sensors, lens fog), when the placement is wrong because the property changed and aim adjustment will not fix it, or when the existing recorder is so old that the HDMI output is composite and the overlay capture quality drops below what is useful. Modern recorders with HDMI 1.4 or higher are fine. Truly ancient analog DVRs sometimes are not. In that case the cheapest move is usually to replace the recorder and keep the cameras, not the other way around.