The Daily Johannesburg

Johannesburg news, every day

News

Johannesburg's Digital Archive Crisis: The Hard Numbers Behind the Duplicate Image Problem

City agencies, newsrooms and property platforms are sitting on billions of redundant image files — and the storage bills are climbing fast.

By Johannesburg News Desk · Published 4 July 2026, 8:45 pm

4 min read

Johannesburg's Digital Archive Crisis: The Hard Numbers Behind the Duplicate Image Problem
Photo: Hutchins, D. E. (David Ernest), 1850-1920 / Public domain (Wikimedia Commons)

Johannesburg's public and private institutions are drowning in duplicate digital images, and the scale of the problem is larger than most administrators want to admit. A growing body of data from technology audits conducted across the Gauteng region shows that duplicate image files — identical or near-identical photos stored multiple times across different servers and platforms — now account for between 30 and 45 percent of total digital storage consumed by mid-sized South African organisations. For a city the size of Joburg, running dozens of municipal databases, property registries and media archives, that redundancy translates directly into wasted money.

The timing matters because data storage costs in South Africa have risen sharply since 2024, when the rand depreciated against the dollar and cloud-hosting fees priced in US currency followed. Local IT procurement specialists report that enterprise cloud storage on platforms widely used by Gauteng government departments now runs at roughly R2.80 to R3.40 per gigabyte per month, depending on contract tier — a figure that has roughly doubled over three years. For an institution holding 10 terabytes of image data, of which 35 percent is duplicated, eliminating redundancy could realistically save over R140 000 a year in storage fees alone.

Where the Problem Lives in Joburg

Two institutions illustrate the issue clearly. The City of Johannesburg's Housing and Infrastructure Directorate, which administers property records across zones from Sandton to Soweto, maintains photographic evidence files tied to building inspections, rezoning applications and heritage assessments. Sources familiar with the directorate's IT structure — speaking without attribution because they are not authorised to discuss internal systems publicly — say that image deduplication has not been systematically applied to those archives since at least 2021. The result is a growing backlog of replicated files tied to properties on roads such as Empire Road and Eloff Street Extension, where frequent rezoning activity generates repeated photographic submissions of the same sites.

On the private side, PropData, a Johannesburg-based real estate software company that services estate agencies across Gauteng, has publicly acknowledged that listing platforms routinely accumulate duplicate property photographs uploaded by multiple agents representing the same property. The company has promoted automated deduplication tools as part of its product suite, but uptake among smaller agencies — particularly those operating out of Rosebank and Randburg — remains uneven. A single high-turnover suburb like Fourways can generate hundreds of duplicate listing images in a single month during a busy sales season.

Media organisations face the same pressure. The South African Press Association maintains a central image library accessed by newsrooms from Braamfontein to the East Rand. Industry estimates, drawn from international benchmarks published by the Digital Asset Management Society, suggest that news photo archives without active deduplication tools carry redundancy rates as high as 40 percent. Applied to a national wire photo database, that figure represents tens of thousands of images stored unnecessarily.

What Deduplication Actually Costs — and Saves

The technology to fix this is neither new nor especially expensive. Perceptual hashing algorithms, which compare images mathematically to flag near-duplicates even when file names differ, have been commercially available since the early 2010s. Open-source tools such as dupeGuru and rmlint are free. Enterprise-grade solutions from vendors including Cloudinary and Canto charge licensing fees typically ranging from R8 000 to R45 000 per year for mid-market organisations, depending on storage volume and user count.

For the City of Johannesburg specifically, the business case is straightforward arithmetic. If the Housing Directorate's image archive runs to even 5 terabytes of active storage and 35 percent of that is duplicate data, clearing the redundancy at current cloud pricing saves approximately R59 000 annually — enough to fund a junior data administrator's quarterly salary. At the scale of the full City of Johannesburg metropolitan IT estate, which spans departments from the Johannesburg Roads Agency to Joburg Water, the aggregate saving could be substantially higher.

The practical next step for any Joburg institution is an audit. Technology teams should run a baseline deduplication scan — most commercial tools offer a free trial period of 14 to 30 days — before the 2026-2027 municipal budget cycle finalises procurement decisions in August. Organisations that wait risk carrying inflated storage costs into a budget year when Gauteng's ANC-DA coalition government has signalled continued pressure on departmental spending. Getting the numbers right now is cheaper than explaining the waste later.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Johannesburg

This article was produced by the The Daily Johannesburg editorial desk and covers news in Johannesburg. See our editorial standards for how we use AI.

The Daily Johannesburg brief

The day's Johannesburg news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Johannesburg and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Johannesburg news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Johannesburg and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Johannesburg

More in News

Enjoyed this story? Get tomorrow's briefing free.