Why Most Organisations Keep Data Too Long and How to Fix It

Most organisations keep far more personal data than they need. Customer records remain in old systems. Employee files stay in shared drives long after staff leave. Marketing lists grow without review. Project folders are copied,...

S

Sheikh Nasim

Jun 15, 2026

12 min read

Why Most Organisations Keep Data Too Long and How to Fix It

Most organisations keep far more personal data than they need. Customer records remain in old systems. Employee files stay in shared drives long after staff leave. Marketing lists grow without review. Project folders are copied, archived, and forgotten. Backups, email inboxes, spreadsheets, cloud tools, and legacy platforms quietly hold data that no one uses and few people remember.

Under UK General Data Protection Regulation, usually called UK GDPR, this creates a serious compliance problem. The storage limitation principle requires organisations to keep personal data only for as long as necessary for the purposes for which it is processed. In plain terms, if you no longer need the data and have no lawful reason to keep it, you should not keep it.

The problem is that deletion feels risky. Storage feels safe. Many teams would rather keep data “just in case” than make a decision about disposal. But over-retention is not safe. It increases breach severity, raises legal discovery costs, creates unnecessary operational burden, and makes it harder to respond to data subject rights requests.

This guide explains why over-retention happens, what it costs, and how to build data retention schedules GDPR teams can actually use in practice.

Why do most organisations struggle to manage data retention effectively?

Retention problems are rarely caused by one person making a bad decision. They usually come from culture, unclear ownership, weak systems, and the absence of a practical retention schedule.

The “just in case” culture — why deletion feels risky and storage feels safe

Many organisations keep data because someone thinks it might be useful one day. Sales may want old leads. HR may want historic records. Operations may want evidence of past decisions. IT may prefer not to delete anything because deletion can feel irreversible.

This “just in case” mindset is common, but it is not a lawful retention strategy. UK GDPR requires a reasoned decision about necessity. Keeping data because it may possibly be useful at some undefined point in the future is usually too vague.

A practical retention programme gives staff confidence to delete. It replaces personal judgement with agreed rules, documented triggers, and approved disposal processes.

The hidden costs of over-retention: breach severity, litigation, storage, and regulatory exposure

Over-retention increases risk. If a data breach occurs, the organisation may expose years of unnecessary records. The breach becomes larger, more expensive, and more damaging because the data was still there.

It also increases litigation and disclosure costs. If old records are kept, they may need to be searched, reviewed, redacted, and disclosed in legal or regulatory processes. Storage costs may seem low, but the cost of managing old data is not.

Over-retention GDPR breach risk is therefore practical, not theoretical. The less unnecessary personal data you hold, the less there is to lose, search, secure, and explain.

ROT data — Redundant, Obsolete, and Trivial — and how to identify what you genuinely do not need

ROT data means redundant, obsolete, and trivial information. It includes duplicate files, old exports, outdated reports, unused backups, abandoned project folders, draft copies, expired customer lists, and unnecessary attachments.

Operations teams can identify ROT data by asking: Is this still needed for a live purpose? Is it accurate? Is it duplicated elsewhere? Is there a legal reason to keep it? Does anyone own it? Is it covered by a retention period?

If the answer is no, it should be reviewed for deletion, anonymisation, or secure archiving.

What the ICO’s enforcement record on retention failures reveals about common organisational failures

Regulators expect organisations to know what data they hold and why they still hold it. Common retention failures include no documented retention schedule, unclear ownership, old data stored in unmanaged systems, excessive backup retention, and deletion policies that are written but not followed.

A policy alone is not enough. The organisation must be able to show that retention rules are applied in practice.

What does the GDPR storage limitation principle actually require in practice?

The GDPR storage limitation principle requires more than a sentence in a privacy policy. It requires operational control over the data lifecycle.

Article 5(1)(e): “no longer than is necessary” — what this standard demands of an organisation

Article 5(1)(e) says personal data should be kept in a form that permits identification of individuals for no longer than necessary for the purposes for which it is processed.

This does not create one universal retention period. Instead, each organisation must decide what is necessary for each data type and purpose. Payroll records, customer complaints, marketing leads, CCTV footage, clinical records, contracts, and recruitment data may all need different periods.

A good retention schedule explains the period and the reason behind it.

The difference between storage limitation and data minimisation — and why both apply

Data minimisation and storage limitation are related but different. Data minimisation asks whether you need to collect or hold the data at all. Storage limitation asks how long you need to keep it.

For example, a recruitment team may need to collect CVs during a hiring process. Once the process ends, the organisation must decide how long to keep unsuccessful applicant data. It should not keep the data indefinitely simply because it was collected lawfully at the start.

Legal hold obligations: the circumstances in which scheduled deletion must be paused

A legal hold data retention process pauses deletion when records may be needed for litigation, regulatory investigation, audit, complaint handling, or another legal obligation. This is important because automatic deletion can create risk if relevant evidence is destroyed after a dispute has arisen.

A legal hold should be specific, documented, approved, and lifted when it is no longer needed. It should not become a permanent excuse to keep everything.

Statutory retention requirements by sector: HMRC, HR, financial services, healthcare, and legal

Some data must be kept for specific legal or regulatory reasons. Examples may include tax records, employment records, financial services records, healthcare records, insurance documents, and legal matter files.

Data retention periods UK organisations use should therefore be based on a mix of legal requirements, limitation periods, regulatory guidance, business need, and risk. Where no fixed legal period applies, the organisation should make and document a reasonable decision.

How do you build a data retention schedule from scratch?

A retention schedule is a structured document that tells the organisation what data to keep, why, for how long, and what happens when the period ends.

Identifying the data categories your organisation holds across all systems and locations

Start with data mapping. Identify the personal data held across systems, departments, cloud tools, shared drives, email archives, paper files, backups, and third-party platforms.

Common categories include HR data, payroll data, customer records, supplier contacts, marketing data, financial records, contracts, support tickets, website data, CCTV, call recordings, and operational logs.

This step links closely to data lifecycle management operations because you need to understand where data begins, where it moves, and where it ends.

Setting retention periods: the decision criteria and the questions to ask for each data type

For each data category, ask: What purpose does this data serve? Is there a legal or regulatory retention requirement? Is there a limitation period? Is the data needed for complaints, warranties, audits, or disputes? What harm could result from keeping it too long? What harm could result from deleting it too early?

This is the core of how to build a retention schedule. The period should be defensible, not copied blindly from another organisation.

Defining the retention trigger: from creation date, from contract end, from last activity, or from termination

A retention period needs a trigger. Without a trigger, no one knows when the clock starts.

For example, a customer account record might be kept for a defined period after account closure. Recruitment data might be kept from the end of the recruitment process. Contract records might be kept from contract termination. Marketing data might be reviewed after the last meaningful engagement.

Clear triggers make retention rules operational.

Documenting the legal, regulatory, or business driver for each retention period

Every retention period should have a reason. The reason may be statutory, regulatory, contractual, legal limitation, business necessity, audit, or operational need.

Documenting the reason helps the organisation answer questions from regulators, auditors, data subjects, and internal teams. It also prevents arbitrary retention periods becoming permanent.

Retention schedule structure: what a well-built schedule contains column by column

A practical schedule should include data category, processing purpose, system location, business owner, lawful basis, retention period, retention trigger, legal or business justification, disposal action, disposal method, backup treatment, review date, and approval owner.

The structure should be simple enough for operations teams to use. A complex document that no one understands will not be enforced.

How should operations teams implement and enforce a retention schedule in practice?

A retention schedule is only useful if it changes behaviour.

Assigning data ownership: the roles responsible for applying the schedule across business functions

Each data category should have an owner. HR should own HR records. Finance should own finance records. Marketing should own marketing data. IT should support technical deletion. Compliance or the Data Protection Officer should provide oversight.

Without ownership, retention becomes everyone’s responsibility and no one’s responsibility.

Automated deletion versus manual deletion: what technology can support and what it cannot replace

Automated deletion can be powerful. Systems can delete expired records, archive inactive accounts, purge old logs, or apply retention labels. However, automation only works when rules are accurate and systems are configured correctly.

Manual review may still be needed where data is mixed, subject to legal hold, stored in shared drives, or linked to disputes. Technology supports retention, but it does not replace governance.

Cloud and SaaS systems: where your retention schedule meets your vendor contract obligations

Cloud and Software as a Service platforms can create retention challenges. Some vendors offer granular deletion settings. Others retain backups, logs, metadata, or support records according to their own schedules.

Vendor contracts should address retention, deletion, return of data, backup periods, sub-processors, and evidence of disposal. Data deletion policy compliance depends partly on whether vendors can follow your retention requirements.

Backup and archive data: the most frequently overlooked retention compliance challenge

Backup data retention GDPR issues are difficult because backups are designed to preserve data, not delete individual records on demand. Organisations should define backup retention periods, restrict access, prevent routine use of backup data, and ensure backup data is overwritten or deleted in line with documented schedules.

Archives are different from backups. An archive is searchable and often used. If personal data remains accessible in an archive, retention rules still need to apply.

The disposal log: why documenting deletion is as important as carrying it out

Secure data disposal GDPR practice should be documented. A disposal log may record what was deleted, when, by whom, under what authority, from which system, and by what method.

For physical records, secure shredding or certified destruction may be appropriate. For electronic records, deletion should prevent unauthorised recovery where reasonably possible. Evidence matters because it shows the schedule is being applied.

How do you manage the conflict between retention obligations and individual erasure requests?

Retention and erasure can appear to conflict. The right answer depends on the purpose, legal basis, and obligation to keep the data.

The right to erasure versus statutory retention requirements: the decision framework

If an individual asks for deletion, first identify the data, purpose, lawful basis, and retention requirement. If there is no longer a lawful reason to keep it, deletion may be required. If a statutory obligation requires retention, the organisation may need to keep the data despite the request.

The response should be clear and specific. Do not simply say “we keep data for legal reasons” without explaining the relevant category and rationale.

When you must delete despite a retention schedule — and when you cannot, even if asked

A retention schedule sets maximum or expected periods, but it should not be used as an excuse to keep data that is no longer needed. If the purpose has ended and no lawful reason remains, deletion may be appropriate before the scheduled date.

On the other hand, if the organisation must keep certain records for tax, employment, legal, regulatory, or dispute reasons, it may be unable to delete them immediately.

Partial erasure and pseudonymisation: when they are an appropriate alternative to full deletion

Sometimes full deletion is not possible or proportionate. Partial erasure may remove unnecessary fields while retaining required records. Pseudonymisation may reduce identifiability where continued processing is needed.

These approaches should be documented and explained. They are not shortcuts to avoid deletion, but they can reduce privacy risk where full deletion is not appropriate.

Communicating retention decisions to data subjects who submit erasure requests they expect to succeed

People often expect deletion requests to mean all data disappears immediately. A good response should explain what has been deleted, what has been retained, why it has been retained, how long it will be kept, and what rights remain available.

Clear communication reduces complaints and shows accountability.

FAQs

Can an organisation keep personal data indefinitely if it has a legitimate business purpose?

Usually no. A legitimate business purpose must still be specific, necessary, and proportionate. Indefinite retention is difficult to justify unless there is a strong legal or operational reason, and even then the data should be reviewed, secured, and limited.

What is a legal hold and how does it interact with a scheduled deletion process?

A legal hold is an instruction to pause deletion for specific data because it may be needed for litigation, investigation, audit, complaint handling, or another legal obligation. It overrides normal scheduled deletion for the affected records, but only for as long as the hold remains justified.

How should organisations handle personal data held in backup systems and disaster recovery archives?

Organisations should define backup retention periods, limit access to backups, avoid using backup data for live processing, and ensure backups are overwritten or deleted according to schedule. Where individual deletion from backups is technically difficult, organisations should prevent restored backup data from reintroducing erased records into live systems.

Conclusion

Over-retention is not just a technical problem. It is an organisational culture problem made worse by unclear ownership, weak systems, and the absence of a practical retention schedule. Keeping data forever may feel safe, but it increases breach risk, operational burden, legal exposure, and regulatory scrutiny.

Fixing the problem requires clear decision criteria, documented retention periods, assigned owners, deletion workflows, vendor controls, backup rules, disposal logs, and regular review. Most importantly, it requires the discipline to delete data when the time comes.

Stop letting data accumulate beyond its useful and lawful life. Our Data Retention And Deletion Schedules For Operations Teams course gives your operations team a practical retention framework they can implement immediately.

For related learning, explore Data Mapping And Records Of Processing Activities ROPA, Handling Data Subject Access Requests DSAR End To End, and Vendor Risk And Data Processing Agreements For Privacy.

Search