Decision guide

Real time alerts on existing CCTV: there are five paths and four of them quietly assume you have IP cameras

If you already own a CCTV system and you want it to text someone the moment a person walks into a parking lot at 2 AM, the honest answer is that there are five practical paths to wire that up. Most guides on this topic cover the four that quietly depend on you having the right hardware and skip the fifth one entirely. This is the rest of the picture.

M
Matthew Diakonov
9 min read

Direct answer (verified 2026-05-01)

Five paths exist. Only one works on every legacy CCTV system without further qualification.

  • 1. Built-in DVR or NVR motion alerts. Real time but pixel-based, false positive rate over 95 percent on outdoor scenes.
  • 2. Frigate or similar open-source NVR pulling RTSP from IP cameras. Requires IP cameras with usable RTSP. Breaks on analog coax and on vendor-locked firmwares.
  • 3. Cloud upload to a cloud AI service. Works in principle, breaks in practice on multifamily uplink bandwidth.
  • 4. Full camera replacement (Verkada, Rhombus, etc.). Highest detection quality, 10,000 to 25,000 USD per property.
  • 5. HDMI capture from the recorder's wall-display output. Works on every recorder that drives a wall display, which is every recorder. The path the other guides leave out.
20

At one Class C multifamily property in Fort Worth, Cyrano caught 20 incidents including a break-in attempt in the first month. Customer renewed after 30 days.

Fort Worth, TX property deployment, path 5 (HDMI capture)

The five paths, in detail

Each path has a place where it shines and a property type where it falls apart. Read the path that matches the recorder you already own and skip the rest.

1

Path 1, Built-in DVR or NVR motion alerts

What every recorder ships with. Pixel-difference math against a drawn rectangle, push notifications via the manufacturer's mobile app. No AI. No track. No threat assessment.

Works on: every system that has a working manufacturer app and a stable internet connection.

Breaks on: false positive rate is typically over 95 percent on outdoor scenes. Most teams disable the alerts within two weeks. The recorder is technically alerting in real time. The operator is not getting useful information.

Cost: free. Bandwidth: the manufacturer app pulls the live preview from the recorder over the internet, which on a stretched cable line is the unreliable hop.

2

Path 2, IP camera RTSP into Frigate or a similar open-source NVR

Pull a substream off each IP camera over RTSP, run a local AI detector against each substream, write events to a separate NVR. Notifications go through Home Assistant, MQTT, or a webhook to whatever messaging tool the team uses.

Works on: IP cameras with a documented RTSP path and credentials you control. Frigate, Scrypted, and similar projects all assume this.

Breaks on: analog coax cameras (HD-TVI, HD-CVI, AHD) have no RTSP. IP cameras locked behind a vendor cloud (Lorex, some Reolink and Hikvision firmwares) have RTSP disabled or rotated. ONVIF is supposed to standardize this and frequently does not on cameras older than 2020.

Cost: a separate small server, an Nvidia or Coral accelerator, plus the time to wire each camera and tune zones. Realistic install on a 16 camera property is days, not minutes.

3

Path 3, Cloud upload of every camera stream to a cloud AI service

Push each camera (or each clip) to a cloud video service, run AI in the cloud, send alerts back. Videoloft, Eagle Eye Networks, and most camera vendors' cloud add-ons fit this shape.

Works on: any system that can push video out, which on most modern recorders is supported through the manufacturer's cloud bridge.

Breaks on: uplink bandwidth on a typical multifamily property. A 16-camera continuous upload at 720p is a few megabits per second sustained, which a stretched cable line will not deliver reliably. The latency is the variance, not the average. Internet outage takes the alerts down. Recurring cost typically scales per camera per month.

Privacy: video leaves the building. On a tenant-occupied property, this can be a regulatory issue depending on the jurisdiction.

4

Path 4, Replace cameras with smart cameras that run AI on the camera

Rip out the existing cameras, install Verkada, Rhombus, or a similar enterprise-grade smart camera system that runs AI on the camera itself and pushes alerts to a cloud dashboard.

Works on: any property where the budget exists. Detection quality is the highest of any path because the AI runs against the original full-resolution frame, not a multi-view crop or an uplink-throttled stream.

Breaks on: capital cost. A 25-camera Verkada or Rhombus install is typically 10,000 to 25,000 USD in hardware, plus per-camera annual licensing, plus install labor, plus weeks to months of downtime as cameras and cabling get swapped. On a Class B or C multifamily property the math rarely works.

5

Path 5, HDMI capture from the recorder's wall-display output

Plug an edge AI box into the HDMI port of the existing DVR or NVR. The box reads the multi-view feed the recorder sends to the wall display, splits it back into per-tile streams, and runs detection per tile on a local accelerator.

Works on: every recorder that drives a wall display, which is essentially every install. Analog HD-TVI and HD-CVI recorders, hybrid NVRs, IP NVRs, all of them composite their feeds onto an HDMI output for the management office screen. Capture is downstream of every camera and recorder choice.

Breaks on: detection quality scales with per-tile resolution after the crop. A 16-tile view on a 1080p output gives each camera roughly 480 by 270 pixels of working area, which is enough for person and vehicle detection but limits fine-grained classifications. If the management office uses a 4K output the per-tile resolution doubles.

Install: under 2 minutes for the physical hookup. The unit then needs network and an alert channel (SMS or a chat thread) configured.

Why the HDMI path is the one the other guides skip

Most playbooks online assume your cameras are IP and your recorder is recent. On a fresh build that assumption holds. On a typical Class B or Class C multifamily property the recorder is six to ten years old, the cameras are analog HD-TVI or HD-CVI over coax, and the manufacturer's cloud either doesn't exist or charges per-camera-per-month at portfolio scale. Every IP camera path silently rules out half the install base.

The HDMI path doesn't have that assumption baked in. The recorder already does the hard work of decoding every camera, regardless of brand or wiring, and compositing them into a multi-view for the wall display. An HDMI capture box reads exactly what an operator on the wall display sees. Detection runs per tile on a local accelerator. The same code path works for a 2015 analog DVR and a 2024 hybrid NVR.

The honest tradeoff is per-tile resolution. On a 16-tile multi-view at 1080p, each camera occupies roughly 480 by 270 pixels. That's enough for person and vehicle detection. It's not enough for license plate recognition at 50 feet or for distinguishing a uniformed delivery driver from a generic stranger at the same distance. If you need that, replace the cameras and pay the capital cost. If you need to know that a person is in the parking lot at 2 AM and you need to know within seconds, the HDMI path is the cheapest answer that exists.

What the night shift looks like before and after wiring path 5 onto an existing CCTV

The DVR records 24/7. The wall display shows 16 tiles to an empty management office. Built-in motion alerts get muted on every operator phone within two weeks because they fire on rain, headlights, and the dog from unit 3B walking back from the dumpster. Every incident is reviewed the morning after the resident complains, which usually means scrubbing two to four hours of footage across multiple feeds.

  • Motion alerts disabled by week 2
  • Incidents discovered the next morning
  • 2 to 4 hours of footage review per incident
  • Same recorder, same cameras, same coax

Path 2 (Frigate over RTSP) vs. Path 5 (HDMI capture)

Both paths run AI locally on edge hardware. The difference is what each one assumes about your CCTV before you start.

FeatureFrigate over RTSPHDMI capture (Cyrano)
Works on analog HD-TVI / HD-CVI / AHD coaxNo (no RTSP available)Yes (capture is downstream of the recorder)
Works on vendor-locked IP cameras (RTSP disabled)NoYes
Detection input resolutionCamera substream, typically 640x360 or 1280x720Per-tile crop of the recorder's HDMI output, ~480x270 on a 16-tile 1080p layout
Install effort on a 16-camera propertyDays (per-camera RTSP wiring, zones, tuning)Minutes for the physical hookup, then dashboard zone config
Hardware requiredServer + accelerator (Coral / Nvidia)Single edge box (HDMI input + accelerator)
Where the inference runsOn a server you provision and maintainOn the device, no server to provision
Up-front costServer + accelerator + setup time$450 one-time, $200/mo starting month 2

Frigate is the right answer for a fresh IP-camera build with documented RTSP paths. HDMI capture is the right answer for everything else.

What an honest install looks like

Path 5, end to end

  • Identify the HDMI port on the existing DVR or NVR (every recorder has one)
  • Plug the edge AI box between the recorder and the wall display (HDMI passthrough)
  • Connect the box to the property network on its own subnet or VLAN
  • Open the dashboard, name each tile after the camera it shows (parking lot, pool gate, east entrance)
  • Draw zones for the events that matter (perimeter cross, restricted zone, loitering past 60 seconds)
  • Configure SMS and phone-call alert channels for HIGH THREAT events, daily digest for LOW THREAT
  • Walk the property, trigger each zone, confirm the alert lands on the on-call manager's phone

The dashboard step is where the work actually lives. Drawing zones for the events that matter on a property is property-specific knowledge that doesn't exist out of the box. A leasing-office tile and a pool-gate tile both show people but the rules that should fire are different. Spending an hour walking the property with the zone editor open beats every default config.

Want to see the HDMI capture path on your own recorder before you decide?

A 15 minute call with the founder. Bring the make and model of your DVR or NVR and the rough camera count. We will tell you whether the HDMI path is a fit and whether the per-tile resolution will work for your scenes.

Common questions about real time alerts on existing CCTV

What does "real time" actually mean for a CCTV alert?

Real time means the alert lands on a phone while the operator can still take an action that changes the outcome. That is usually single-digit seconds from the frame being captured to the SMS or chat thread buzzing. If the alert lands after the person has already left the property, that is a faster index into recorded footage, not a real time alert. The cutoff is the action window, not the wall-clock latency in milliseconds.

Why don't the built-in motion alerts on my DVR or NVR count as real time alerts?

They do fire in real time, but they are pixel-difference math against a drawn rectangle, not detections of a person. They alert on raccoons, headlights, rain on the lens, sunset gradients, spider webs in front of an IR illuminator, and the apartment dog walking back from the dumpster. Most teams stop checking the alerts within two weeks because the false positive rate is over 95 percent. The recorder is technically alerting in real time. The operator is not getting useful information.

Can I plug Frigate or a similar open source NVR into my existing CCTV?

Only if your cameras are IP and you can pull a usable RTSP stream off them. Frigate runs detection on substreams pulled from each camera, then writes events to its own NVR. If your cameras are analog HD-TVI, HD-CVI, or AHD over coax (the dominant install for properties built before 2018), there is no RTSP to pull. If your cameras are IP but the manufacturer has locked the RTSP path or rotated credentials behind a vendor cloud, the same problem appears. Frigate is a great fit for a fresh IP-camera build. It is not a fit for arbitrary legacy CCTV.

What is HDMI capture and why does it work where the other approaches don't?

Every DVR and NVR has an HDMI output. That output drives the wall display in the management office: a composited multi-view of every camera the recorder knows about. An HDMI capture box plugs into that output, splits the multi-view back into per-tile streams, and runs detection per tile on a local accelerator. The capture is downstream of every camera and recorder decision, so it does not care whether the cameras are analog coax or IP, whether the recorder supports RTSP, or whether the manufacturer has locked the firmware. If the recorder can drive a wall display, capture works. The tradeoff is detection quality scales with per-tile resolution after the crop, so a 16-tile view on a 1080p output gives each camera roughly 480 by 270 pixels of working area.

How fast is real time on an HDMI capture path versus a cloud upload path?

On HDMI capture with on-prem inference, the perceived latency from intrusion to phone notification is typically 3 to 8 seconds. The hops are HDMI capture (sub-second), per-tile inference on the edge accelerator (100 to 400 ms), event de-duplication and threat classification (sub-second), thumbnail crop (sub-second), push to the messaging provider (1 to 3 seconds), provider fan-out to the device (1 to 3 seconds). On a cloud upload path, an uplink round trip stacks on top: the camera or recorder uploads each frame or each clip to a cloud service, the service runs inference, then the alert flows out. On a typical multifamily DSL or business cable line the uplink is the unstable hop and adds 2 to 30 seconds depending on the upload bandwidth and cloud queueing. The variance is the problem, not the average.

Will any of these paths cut down the volume of false-positive alerts?

AI-based paths (Frigate, camera-side AI, HDMI capture with an edge AI box, cloud AI services) all reduce false positives by replacing pixel motion with track-based person detection, so insects on the lens, headlights, animals, and rain stop generating alerts. Built-in DVR motion alerts do not. The remaining false positives on AI paths are humans doing legitimate things (a vendor at 3 AM, a tenant returning home through the garage at 1 AM). Those are not detection problems, they are property-context problems, and they need rules for known schedules or known vehicles to suppress.

How many camera feeds can one HDMI capture device cover?

Depends on the inference budget and the multi-view layout. Cyrano's current edge box supports up to 25 camera feeds per unit. The recorder's HDMI output is configured to display a 4 by 4 or 5 by 5 grid (depending on how many cameras the property has), the capture box reads that grid, and the on-device accelerator runs per-tile inference at a few frames per second per tile. Per-tile resolution drops as the grid grows: a 5 by 5 grid on 1080p output gives each tile about 384 by 216 pixels, which is enough for person detection but starts to limit fine-grained classifications.

Do I need to replace cameras to make this work?

No. Camera replacement is a separate axis from real time alerts. Replacing cameras gives you camera-side AI, higher resolution per feed, and modern wiring. None of those are required for real time alerts. An HDMI capture path runs on whatever the recorder produces, including 720p analog HD-TVI feeds composited onto a 1080p wall display. If you replace cameras later, the capture path benefits from the higher per-tile resolution but does not require the swap. If you have already paid for the cameras and the coax pulls and the recorder, replacing them just to get alerts is a sunk-cost mistake.

What if my CCTV system uses a vendor cloud that already offers AI alerts?

That is a legitimate sixth path on modern systems and worth considering before adding any other hardware. The honest tradeoffs are: pricing typically scales per camera per month and gets expensive at portfolio scale, the AI runs in the vendor's cloud (so an internet outage takes the alerts down), the alerts use whatever event taxonomy the vendor has chosen (rarely property-specific), and the data leaves the building (which can be a tenant privacy issue depending on the jurisdiction). On most properties built before 2020 the vendor cloud either does not exist or charges enough per camera per month to make a one-time HDMI capture box plus a flat monthly fee cheaper across a 16 to 25 camera install.

🛡️CyranoEdge AI Security for Apartments
© 2026 Cyrano. All rights reserved.

How did this page land for you?

React to reveal totals

Comments ()

Leave a comment to see what others are saying.

Public and anonymous. No signup.