The Daily Johannesburg

Johannesburg news, every day

News

Johannesburg's Digital Archive Crisis: The Hidden Numbers Behind the City's Duplicate Image Problem

From the Joburg City Archives in Braamfontein to private media houses along Bree Street, the proliferation of duplicate digital images is costing organisations millions of rands in wasted storage and labour.

By Johannesburg News Desk · Published 4 July 2026, 9:26 pm

4 min read

Johannesburg's Digital Archive Crisis: The Hidden Numbers Behind the City's Duplicate Image Problem
Photo: Photo by Steward Masweneng on Pexels

Duplicate images are eating Johannesburg's digital budgets alive. Across municipal departments, newsrooms, property platforms and cultural institutions in the city, redundant digital files — identical or near-identical photographs stored multiple times across different servers — have quietly ballooned into a measurable operational crisis that IT managers and archivists are only now beginning to quantify.

The timing matters. With load shedding reductions freeing up more consistent uptime for cloud-syncing workflows since late 2025, Johannesburg-based organisations have been uploading data at a faster rate than at any point in the past decade. Every power-restored sync cycle can push duplicate copies of the same image file onto multiple servers simultaneously, compounding a problem that was already significant before Eskom's stage reductions began cutting into municipal productivity losses.

The Scale of the Problem in Numbers

Industry benchmarks from international data management research consistently place the proportion of duplicate files in large unmanaged digital asset libraries at between 20 and 40 percent of total stored content. For an organisation like the Joburg City Archives, housed in the Civic Centre precinct off Loveday Street in the Johannesburg CBD, or Wits University's Historical Papers Research Archive on the Braamfontein campus, that figure translates directly into rand costs. Cloud and on-premises storage in South Africa is priced in US dollar equivalents, and with the rand trading in the R18-to-R19 range against the dollar through the first half of 2026, every unnecessary gigabyte stored carries a real local currency premium.

A mid-sized Johannesburg property portal — the kind common to Sandton's commercial belt — maintaining a library of, say, 500,000 listing photographs could realistically be carrying 100,000 to 200,000 duplicate or near-duplicate images. At current Amazon Web Services S3-equivalent pricing available to South African businesses, storage costs for that redundant layer can run to tens of thousands of rands monthly, before factoring in bandwidth and retrieval fees. The South African Broadcasting Corporation, headquartered on Radio Park in Auckland Park, and the Independent Media Group, with offices in the inner city, both manage photographic libraries running into the millions of assets — scales at which even a 10 percent duplication rate represents a substantial financial drag.

The labour dimension is just as significant. Digital asset management consultants working with Gauteng-based clients estimate that an archivist or media librarian can spend between 15 and 25 percent of their working week on manual deduplication tasks when no automated system is in place. At a loaded employment cost of roughly R35,000 to R55,000 per month for a senior archivist in Johannesburg's current market, that represents between R5,250 and R13,750 per employee per month in unproductive labour — purely from managing image redundancy.

What Organisations Are Doing About It

Automated deduplication software has existed for years, but uptake among Johannesburg's public institutions has been slow. Budget cycles tied to Gauteng provincial procurement rules, which require multi-quote processes for software purchases above R500,000, have delayed deployments at several city entities. The Joburg Development Agency and the City of Johannesburg's Group Information and Communications Technology Department have both flagged digital asset rationalisation in planning documents as a priority area, though formal programme announcements with confirmed funding have not materialised publicly.

Private sector adoption is moving faster. Several media production companies clustered around Rosebank and Melrose Arch have implemented perceptual hashing tools — software that detects visually identical images even when file names or metadata differ — since 2024, with reported storage cost reductions of between 18 and 32 percent in the first year of deployment. Perceptual hashing works by generating a short numerical fingerprint from an image's visual content rather than its file properties, making it far more effective than simple checksum matching for photographic archives where the same image may have been re-exported at different resolutions.

For smaller organisations — the community newspapers operating out of Soweto's Dobsonville and Meadowlands areas, or the cultural economy enterprises operating under the Johannesburg Tourism Company's broader ecosystem — the practical entry point is simpler. Free or low-cost tools such as open-source deduplication scripts run locally on a desktop can clear backlogs without procurement complications. The most important step, according to data management practice guides, is establishing a single ingest point for all images before they reach storage — a workflow change that costs nothing and prevents the majority of duplicates from accumulating in the first place. Organisations that implement that discipline now, before their libraries grow further, will find the retrospective cleanup task substantially lighter when they eventually get to it.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Johannesburg

This article was produced by the The Daily Johannesburg editorial desk and covers news in Johannesburg. See our editorial standards for how we use AI.

The Daily Johannesburg brief

The day's Johannesburg news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Johannesburg and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Johannesburg news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Johannesburg and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Johannesburg

More in News

Enjoyed this story? Get tomorrow's briefing free.