The three types of data putting law firms at risk

9 July 2018

Posted by Sam Jefferies, Vice President of EMEA at Legal Futures Associate DocsCorp

Jefferies: Don’t wait for a leak to happen

Data seems to be the word on everyone’s lips right now. Phrases like data breaches, data harvesting, and data regulations clog our news feeds until our brains start turning it all into white noise.

But don’t let the overwhelming amount of information out there blind you to what’s important – protecting the data you or your firm holds.

Most data leaks happen accidentally and could be as innocent as a reader uncovering information the sender may never have realised was there.

So, how can you take back control of what you share? The key lies in these three important terms.

Metadata

One of the biggest worries the firms I speak with have is not knowing what’s in a document beyond what they have typed. With every Microsoft Word file comes complex metadata – like total editing time, last modified date, and author names – that can tell the reader much more than what would be printed on a piece of paper.

Metadata isn’t all bad. In fact, a lot of it can help with document management. Metadata like tags, title, and creation date can be searched for in a file system like Microsoft SharePoint, making file discovery easier and more accurate.

However, you should be careful with what metadata you leave attached to documents sent to people outside of the business.

Metadata like total editing time, comments, and anything else never intended to live beyond the draft stage should be wiped, so it doesn’t end up in the wrong hands. Clients and opposing counsel could uncover a goldmine of bonus information in a comments section only meant for internal use.

Hidden data

Hidden data is text covered with a black box instead of being redacted correctly or the font colour being turned white, embedded files, Track Changes, hidden formulas, and hidden columns in an Excel spreadsheet.

Any reader can uncover this text, or ‘unhide’ columns, and suddenly have access to a whole host of information that should have been kept private.

My company was once inadvertently given access to more than we bargained for. We were sent a spreadsheet with delegates registered for an event. But, we noticed the column names in the file weren’t A, B, C, but A, D, G. When we unhid the missing columns, we uncovered job titles, email addresses, and telephone numbers – much more than the event organisers told us they could send.

It was a lesson to us as much as it was to them – always know what you’re sending.

Dark data

Where hidden data risked disclosing more information than intended, dark data is not being able to find the information you need.

The lifecycle of dark data begins with scanned files, email attachments, and bulk file imports added into a file system. From there, they go dark simply because they lack the text layer search technology uses to find them.

Image-based files with no text layer, like scanned IDs or invoices, need to be processed through optical character recognition (OCR) technology to be searchable. OCR technology scans an image file and applies a text layer, so it can be searched for using on-page content like a client name or case number.

Dark data is a serious threat to GDPR compliance since a response to a data subject access request requires an organisation to provide all data relating to the requestor. Failing to provide all the information because the documents were undiscoverable can lead to costly disputes, drawn-out negotiations, and financial penalties.

Don’t wait for a leak to happen before you consider what hidden and dark data you’re working with. When it comes to data management, prevention is always better than cure.

_{Tags:
DocsCorp}

Services Directory Advertise Become an Associate

Market Intelligence for Law Firms of the Future

The three types of data putting law firms at risk

Conferences

Claims Futures Conference 2026

Regulation & Compliance Conference 2026

Related News

How does the legal profession become neuroinclusive?

The real cure for inequality at law firms? Listening

Andy Burnham and the Hillsborough Law: should solicitors be worried?

Defending fair fees in the property profession

Judging proportionate risk requires confidence. Do law firms have it?

Features

Concerned that AI is affecting enquiries? Here’s what to do

Service of proceedings by alternative methods

Faster, leaner, smarter: How AI lets small firms compete with BigLaw

Associate News

The login nobody asked for (and the workaround you already have)

Brabners continues Leeds expansion with two more partner hires

Podcast reveals the business development habits costing law firms’ growth

Hemrick O’Malley PLLC selects iManage to consolidate document and email management across its legal team

Small law firms have raised the bar for client service

Why data will define the AI-ready law firm

Why ‘having a policy’ will no longer be enough

The three types of data putting law firms at risk

Upcoming Webinars

Conferences

Related News

Features

Associate News

Associates

Acquira Professional Services

LexisNexis®InterAction®

VinciWorks

InfoTrack

R&R Solutions

Financial & Legal

Access Legal

Brabners

LexisNexis Enterprise Solutions

Efimis

Miller Insurance Services LLP

Valid8 IP

Kord

Recovery First Limited

Osprey Approach

Somuna

Qanooni

Search Acumen

Seven Stars Legal Funding

OneSearch Direct

National Claims

Clio

Internet Erasure Ltd

CEL Solicitors

LEAP Enterprise

Document Direct

Ignite Specialty Risk

Allianz Legal Protection

OneAdvanced

Express Solicitors

DR Solicitors

Temple Legal Protection

Conscious Solutions

SearchFlow

DG Legal

AxiaFunder

Lockton Companies LLP

Bundledocs

Legal intelligence from LexisNexis®

Fraser and Fraser

National Accident Law

BigHand

Linetime

Perfect Portal

O'Connors

iCOFA

Nexa Law

Legal Brokers

ARAG

Legmark

Verisk

National Accident Helpline

Landmark Information Group

Dye & Durham

Stridon

Sign-up for our e‑newsletter