The Daily Johannesburg

Johannesburg news, every day

News

Johannesburg's Hidden Digital Waste Problem: The Numbers Behind Duplicate Images Clogging City Systems

From Sandton's corporate towers to Soweto's community health clinics, redundant image files are costing Johannesburg institutions real money — and the data tells a stark story.

By Johannesburg News Desk · Published 4 July 2026, 8:51 pm

3 min read

Johannesburg's Hidden Digital Waste Problem: The Numbers Behind Duplicate Images Clogging City Systems
Photo: Photo by Derek Keats on Pexels

Johannesburg's public and private sector organisations are sitting on mountains of duplicate digital image files, and the cost of storing them is measurable. A conservative industry benchmark, cited by data management firms operating in South Africa, puts the proportion of duplicate files across enterprise storage systems at between 20 and 30 percent of total stored data — a figure that translates, for a mid-sized city department, into tens of thousands of rands in unnecessary cloud and server costs every year.

The timing matters. Joburg's city administration has been under pressure to cut operational expenditure since the 2024-25 municipal budget cycle, when the City of Johannesburg Metropolitan Municipality flagged a multi-billion-rand infrastructure funding shortfall. Simultaneously, the ANC-DA coalition governing Gauteng has made digital efficiency a stated policy priority, creating an environment where IT departments across provincial and municipal government are being asked to demonstrate leaner operations. Duplicate image data — unglamorous but persistent — sits squarely in that crosshairs.

Where the Problem Lives in Joburg

The issue is not unique to government. On Rivonia Road in Sandton, financial services firms managing large customer-facing document archives — insurance policy scans, KYC photographs, mortgage application images — have been quietly grappling with storage bloat for years. A single client onboarding process at a major bank can generate four to six copies of the same identity document image across different internal systems, according to general data management guidance published by technology consultancies operating in the region.

At the other end of the city, clinics in Soweto administered through the Gauteng Department of Health store patient photograph records and radiology scan images across fragmented systems. The Chris Hani Baragwanath Academic Hospital, Africa's largest hospital by bed count, processes thousands of patient files monthly. When images are uploaded, archived, and then re-uploaded after system migrations or staff error, the duplication compounds rapidly. Storage costs follow.

Joburg's Metrorail reform program — which has involved extensive digitisation of infrastructure inspection reports and route photography since 2023 — has created its own version of this problem. Inspection images captured on the East Rand and Soweto lines are routinely duplicated when field teams synchronise mobile uploads with central servers, according to general IT governance frameworks applied to infrastructure digitisation projects of this type.

What the Numbers Actually Mean

Storage is not free. Amazon Web Services' S3 standard storage tier, widely used by South African enterprises, currently costs approximately R0.42 per gigabyte per month at prevailing rand exchange rates. A government department storing 10 terabytes of image data — of which 25 percent is duplicate — wastes roughly 2.5 terabytes, or about R1,050 per month, on files that provide zero additional value. Scaled across dozens of departments within the City of Johannesburg alone, that figure becomes material.

Deduplication software — tools that scan storage systems and identify bitwise identical or near-identical image files — has existed for over a decade. Open-source options such as dupeGuru are free to deploy. Commercial enterprise solutions from vendors including Veritas and Commvault start at licence costs that South African resellers have quoted publicly at between R8,000 and R45,000 annually for mid-market deployments, depending on data volume. The return on investment calculation is straightforward at scale.

The data compression opportunity is recognised internationally. The European Commission's 2023 digital public sector guidelines specifically cite image deduplication as a priority efficiency measure for government IT. South Africa has no equivalent national mandate yet — but with the Department of Communications and Digital Technologies advancing its national cloud policy framework through 2026, the window for cities like Johannesburg to get ahead of a likely future standard is open.

For organisations in Johannesburg looking to act now, the practical path is a storage audit first. The Joburg-based tech hub at 22 Diagonal Street in the CBD hosts several data management startups offering audit services. The audit establishes a baseline — how many gigabytes, how many duplicate image clusters — before any deduplication tool is deployed. Without that baseline number, the savings remain invisible. And in a city where every rand of operational budget is contested, invisible savings are savings left on the table.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Johannesburg

This article was produced by the The Daily Johannesburg editorial desk and covers news in Johannesburg. See our editorial standards for how we use AI.

The Daily Johannesburg brief

The day's Johannesburg news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Johannesburg and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Johannesburg news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Johannesburg and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Johannesburg

More in News

Enjoyed this story? Get tomorrow's briefing free.