At least three major Nairobi-based institutions — including two county government digital offices and the Kenya ICT Authority's shared cloud repository — launched emergency data-hygiene drives this week after audits revealed that duplicate image files had consumed tens of thousands of gigabytes of redundant storage space, slowing systems and inflating hosting costs at a time when every shilling of public expenditure faces scrutiny under the ongoing IMF fiscal adjustment programme.
The timing is awkward for the Ruto administration. Treasury is under pressure to demonstrate that digitisation investments are delivering efficiency gains, not generating fresh waste. Duplicate images — the same photograph, graphic or scanned document stored two, three, sometimes a dozen times across separate folders or user accounts — are a mundane but expensive problem that has festered inside Kenya's public-sector digital infrastructure since the accelerated cloud migration of 2022 and 2023.
What the Audits Found This Week
The Kenya ICT Authority, headquartered along Waiyaki Way in Westlands, confirmed this week that its latest internal storage audit — covering the shared government cloud environment — flagged duplicate image content as the single largest category of recoverable waste. The authority did not release precise gigabyte figures publicly, but the audit summary, circulated to participating ministries on Tuesday, July 1, described the problem as systemic rather than isolated to one department.
At the Nairobi City County government's e-services division, based off City Hall Way in the central business district, technical staff have been running deduplication scripts since Monday. The county's digital infrastructure supports the Nairobi Integrated Service Management System — a platform used for permit applications, land searches and business licensing — and image-heavy documents like site photographs and scanned title deeds are the primary culprits identified in this week's clean-up operation.
Separately, iHub Nairobi, the long-running tech community space on Ngong Road, hosted a practitioner session on Wednesday evening focused on open-source deduplication tools. Attendees from at least a dozen local startups discussed practical approaches to identifying and replacing duplicate images inside content management pipelines — a problem that is acute for media companies, e-commerce platforms and civic-tech developers building on top of government data.
Why It Matters for Nairobi's Silicon Savannah Ambitions
Kenya's cloud storage costs are not trivial. Commercial providers offering infrastructure to local businesses charge between KSh 8 and KSh 22 per gigabyte per month depending on service tier and redundancy options, according to publicly available pricing listed by local resellers as of mid-2026. For a database carrying hundreds of thousands of duplicate images, the monthly waste can run into hundreds of thousands of shillings — money that Nairobi's cash-strapped county and national agencies demonstrably cannot afford to burn.
The duplication problem has a specific local origin story. When the government's Konza Technopolis authority and various line ministries accelerated their migration to cloud storage between 2022 and 2023, many departments simply uploaded existing local-drive content wholesale, without deduplication checks. Multiple users uploading the same imagery for different projects compounded the problem over two years.
For the private sector along Kilimani Road's growing cluster of media-tech and adtech firms, the lesson is operational. Startups that built image-heavy platforms during the 2023 and 2024 funding boom are now encountering storage bills that did not appear in early projections — and this week's public-sector clean-up has prompted several to run their own internal audits.
The practical path forward is straightforward, if labour-intensive. Organisations are advised to run hash-based deduplication tools — which identify files that are byte-for-byte identical regardless of file name — before replacing duplicate records with a single canonical reference file. For teams operating content management systems, setting upload validation rules that check for existing image hashes before storing a new file prevents duplication at source. The Kenya ICT Authority has indicated it will publish updated data-management guidelines for public-sector entities before the end of July, which is expected to include mandatory deduplication checks as part of cloud storage procurement standards going forward.