City of Johannesburg officials are sitting on a digital archive problem that is larger, and more expensive, than most residents realise. Across at least four municipal departments — including the Johannesburg Development Agency and the City's Geographic Information Systems unit based in Braamfontein — duplicate image files have accumulated to the point where storage costs are measurable, data retrieval is slower, and planning decisions have in documented cases been based on outdated or repeated visual records.
The core issue is straightforward: when staff upload property photographs, infrastructure inspection images or aerial survey files without automated deduplication checks, identical or near-identical images multiply across shared drives. The problem compounds with every system migration. Johannesburg has undergone at least three major IT platform shifts since 2015, each one capable of copying existing file libraries wholesale into new environments without scrubbing redundant data.
What the Numbers Actually Show
Digital storage is not free, and in a municipality where the 2025/26 budget allocated roughly R73 billion across all services — with IT infrastructure representing a growing line item — wasted capacity matters. Industry benchmarks from storage management consultancies operating in South Africa suggest that unmanaged municipal archives typically carry between 25 and 40 percent redundant data. Apply even the lower figure to a mid-size city department running 50 terabytes of visual records, and you are looking at 12.5 terabytes of avoidable storage consumption.
At current enterprise cloud storage pricing available to South African public entities — roughly R400 to R700 per terabyte per month depending on the provider and contract — that redundancy translates to between R5,000 and R8,750 in unnecessary monthly expenditure per department. Scaled across the City's multiple imaging-heavy units, including those managing the Johannesburg Roads Agency's pothole documentation programme and Pikitup's landfill monitoring photography at the Marie Louise site in Soweto, the cumulative waste is not trivial.
The data integrity angle is arguably more serious than the cost angle. Urban planners working from the City's Spatial Development Framework tools rely on accurate, timestamped visual evidence. When a GIS analyst in the Metropolitan Centre on Loveday Street pulls up a property image to assess rezoning applications in areas like Wynberg or Marlboro, a duplicate file tagged with an incorrect date or metadata can push a recommendation in the wrong direction. This is not hypothetical: duplicate file errors have caused workflow failures in comparable African city administrations, including Nairobi's urban planning directorate, which reported the issue publicly in 2023.
Local Programmes Trying to Close the Gap
The Johannesburg Smart City Initiative, which has been piloted in sections of the inner city around Newtown and along the Empire-Perth corridor, includes a data governance workstream. That workstream explicitly identifies image deduplication as a second-phase objective, though no public completion date has been confirmed. The JDA's own digital asset management review, initiated in late 2024, is understood to be working through roughly 1.2 million stored images across its property portfolio documentation — a figure that illustrates why manual checking is not a viable solution.
Automated deduplication tools — software that uses perceptual hashing to identify visually identical images regardless of filename — are available at licensing costs that range from R15,000 to R120,000 annually for enterprise deployments in the South African market. Several Johannesburg-based tech firms, including companies operating out of the Tshimologong Digital Innovation Precinct in Braamfontein, have built localised versions of these tools calibrated for low-bandwidth environments, a practical necessity given that load shedding schedules still affect server uptime in parts of the metro.
For departments that cannot yet afford licensed solutions, open-source tools such as dupeGuru and digiKam offer functional deduplication at no licensing cost, provided IT staff have the capacity to implement and maintain them. The City's Group IT department has reportedly been evaluating both options since early 2026, though no formal procurement decision has been announced publicly. Ratepayers and civic oversight bodies like the Joburg Accountability Project would be within their rights to ask, at the next portfolio committee sitting, exactly when a decision will be made — and what the delay has already cost.