Kenya's public sector digital repositories are carrying hundreds of thousands of duplicate image files — redundant scans, re-uploaded photographs, and copied assets that bloat storage costs, slow retrieval systems, and make a quiet mess of records management that officials are now scrambling to clean up. The problem did not arrive overnight.
The stakes are sharpest right now because the Ruto administration's ongoing fiscal squeeze, tied to IMF programme conditionalities, has forced agencies to rationalise IT spending. Duplicate data is not a trivial annoyance when cloud storage fees are billed in dollars against a shilling that has faced sustained pressure. Every wasted gigabyte carries a real cost on the national budget line.
How the Mess Was Made
The roots go back to at least 2013, when a wave of e-government initiatives pushed ministries and county governments to digitise physical records rapidly. The drive was well-intentioned — paper files in damp Nairobi basements were vulnerable to the same flooding that routinely damages buildings in Eastlands and along the Ngong Road corridor — but the execution lacked uniform standards. Different departments used different scanning software, different naming conventions, and different upload protocols. Files were duplicated when staff transferred them between shared drives on Harambee Avenue government offices and newer cloud-based systems without deletion policies in place.
Nairobi County's own records infrastructure compounded the problem. Between 2018 and 2022, at least three separate digitisation contracts were awarded for the city's land registry and urban planning files — programmes that overlapped in scope and often processed the same source documents. The County Spatial Data Infrastructure initiative, run partly through partnerships with the University of Nairobi's Department of Geospatial and Space Technology along Kikuyu Road, flagged the duplication risk in internal reviews, but those reviews did not translate into a coordinated fix.
Silicon Savannah-era optimism played a role too. Between 2015 and 2020, as Nairobi positioned itself as a continental tech hub — anchored by the iHub on Ngong Road and a cluster of fintech firms in Westlands — both government and private civic-tech partners built dashboards, portals, and open-data platforms that pulled imagery from the same upstream sources. Each new portal cached its own copies. None of them talked to each other about what they already held.
The Numbers and the Reckoning
A 2024 audit by the Kenya ICT Authority, covering selected national government systems, identified duplicate content as one of the top three contributors to unnecessary cloud expenditure — though the authority has not published a definitive figure for the total storage cost attributed to duplication alone. Independent estimates from local IT procurement circles, cited in technology trade publication CIO East Africa, have put redundant data across county and national systems in the tens of terabytes range, with associated annual storage fees running into millions of shillings.
The Gen Z-era tax revolt of 2024 and the political pressure it generated gave fiscal waste a sharper public profile. When MPs debated the Finance Bill that year, scrutiny of government IT spending — long treated as a technical back-office matter — moved briefly into mainstream debate. That scrutiny has not entirely faded. Parliamentary committees reviewing the 2025/26 budget cycle asked ICT Authority representatives pointed questions about value for money in cloud contracts, according to publicly available Hansard records from the National Assembly.
Practical solutions exist and some agencies are already deploying them. Duplicate image detection tools — using perceptual hashing and machine-learning-based deduplication — are now available at relatively low cost through open-source platforms, and several Kenyan startups based in Kilimani and the Nairobi Garage co-working space on Ngong Road have built localised versions tailored to Swahili-language metadata. The Kenya National Archives and Documentation Service has begun piloting a master asset register framework that would give agencies a single reference point before uploading new image files, reducing the chance of future duplication at source.
For county governments and civil society organisations still running unaudited image libraries, the immediate practical step is a deduplication audit before any further cloud migration. Waiting for a national mandate will mean paying for redundant storage through at least one more budget cycle — and in the current fiscal environment, that is an expense few departments can afford to ignore.