Duplicate images are cluttering Johannesburg's official digital infrastructure — and the people tasked with fixing it are not all reading from the same page. The City of Johannesburg's e-government directorate, the Gauteng Department of Infrastructure Development, and several heritage bodies are each grappling separately with repositories bloated by repeated, mislabelled or conflicting visual files, a situation that has quietly worsened since the city's 2023 digital-services consolidation drive.
The issue matters now because Joburg is midway through a major overhaul of its metropolitan data systems. The Joburg Connect project — a city-wide effort to unify service-delivery platforms under a single portal — is scheduled to complete its second phase by the end of 2026. Administrators say duplicate image records are slowing database performance, inflating storage costs, and, in some cases, feeding outdated photographs into public-facing systems that residents actually use to report potholes, downed cables and water outages on platforms like the City of Joburg's ServiceJo'burg app.
Archivists, Tech Officers and Heritage Groups at Odds
At the Johannesburg Heritage Foundation, based in Parktown, archivists have been working since early 2025 to digitise and deduplicate a visual catalogue covering over a century of city imagery — from the mining-era Ferreirasdorp streetscapes to the post-apartheid transformation of Newtown. Staff there have described the scale of the duplication problem as significant, with thousands of image files catalogued multiple times under different reference numbers after successive scanning campaigns used inconsistent naming conventions. The foundation has not released a final count publicly, but the project timeline has already been extended once.
In Soweto, the Ubuntu Heritage Trust, which operates out of Vilakazi Street in Orlando West, has been separately compiling a digital record of the township's cultural economy — markets, murals, performance venues — as part of a tourism-promotion initiative backed by the Gauteng provincial government. Trust administrators have acknowledged that photographs supplied by multiple contributing organisations frequently overlap, with the same storefronts or street corners captured by different photographers and submitted under different metadata. Without a deduplication protocol agreed across contributors, the trust's database risks presenting inflated asset counts to potential investors and tour operators.
Technology specialists working in the Sandton central business district, where several data-management consultancies are based along West Street and Rivonia Road, say the city's problem is not unique — but that Johannesburg's fragmented governance structure makes it harder to solve. The ANC-DA coalition running Gauteng has created a situation where city and provincial mandates over digital infrastructure sometimes overlap without clear authority, according to analysts familiar with the provincial technology budget. Resolving duplication at scale typically requires a single point of accountability, and that accountability is currently contested.
What a Fix Would Actually Require
Deduplication at municipal scale is not a trivial undertaking. Industry practitioners cite perceptual hashing — a technique that identifies visually identical or near-identical images regardless of filename or metadata — as the standard approach for large public-sector repositories. Several South African technology firms, including companies registered with the Technology Innovation Agency, have proposed pilot projects to the City of Joburg, though no procurement announcement has been made public as of July 2026.
The financial stakes are real. Cloud storage costs for the City of Joburg's digital operations are factored into the metro's annual ICT budget, which the 2025-26 Joburg budget tabled in May 2025 placed at roughly R1.2 billion across all departments — a figure that includes licensing, infrastructure and data management. Analysts note that redundant image files, while individually small, aggregate quickly across departments and can add measurably to storage overhead over a multi-year period.
For residents and businesses interacting with city systems, the practical upshot is simpler: service portals that draw on clean, deduplicated visual data respond faster and surface more accurate information. City officials have said the Joburg Connect phase-two rollout will include a data-quality audit. Whether that audit will formally address image duplication — or treat it as a secondary concern behind more urgent service-delivery data — is a question that administrators, heritage workers and technology consultants in this city are still debating.