THE INDEPENDENT MAGAZINE FOR LEGAL PROFESSIONALS
Feature Archives
Offsite Links
Announcements

 

 

<< return to front page

 


Digital Document Discovery: The process of Document Discovery has become a well worn tool in the litigator’s bag of tricks

Rule 31.4 of the Civil Procedure Rules defines a document as “Anything” in which information of any description is recorded”. If we apply this rule to the digital domain, the potential reach offered to the litigator can be substantial. When you consider that by 2007 it is estimated some 35 billion Emails will be sent every day and, on average, around 92% of all the information created during 2002 was held on computers (while a mere 0.01% of the total was held on paper) the quantity of available data is staggering. With so much information, how do we go about separating the digital wheat from the digital chaff? More to the point, how do we work out where to find the wheat in the first place?

The process of Document Discovery has become a well worn tool in the litigator’s bag of tricks. It is a popular belief that if someone has, thought or said something they shouldn’t, somewhere there was likely to be a piece of paper with it written down. Even though we continue to march toward the long held ideal of the paperless office, we still find row upon row of filing cabinets stuffed to the gunnels with potentially incriminating evidence. However, with the current reliance on technology; computers, PDAs, mobile phones and, sometimes, even printers can offer up far more potentially valuable bounty. Digital Document Discovery is, quite simply, the discovery of documents that were created electronically and/or stored digitally.

What You See, Isn’t Always What You Get

The increasing awareness of the value of digital evidence in both civil and criminal matters brings with it many new challenges, not least of which is the concept of what digital evidence actually is. In its most fundamental form, digital evidence is exactly what it says on the tin, digital! It is simply a stream of ones and zeros. But, by using software tools and a little knowledge we can turn this data into information. Quite simply, what we see when we open a document in software such as Microsoft Word is an interpretation of the data that forms the document. This information can be found in word processing documents, spreadsheets, emails and database files to name but a few.

However, the information we see when we use the appropriate software is often just the tip of the iceberg. The deeper value of digital evidence can often be found in the information that is normally hidden from the viewer. For example, a Word document may contain details such as the name of the original author, the owner of the software used to create the document and even previous versions of the document while an Excel spreadsheet might contain hidden cells and formulae. These are all examples of the valuable evidence hidden within the data that is known as metadata, or “information about the information”. It is only possible to recover this metadata, which might be just the “smoking gun” we are looking for, electronically.

As a result of the way we work with and store this digital information, the next challenge that faces us is that of possible spoliation. Digital evidence brings with it a unique potential for spoliation. Simply opening a Word document and printing it can alter document content such as dates, or metadata such as user details and information about the last printer to be used. Other potentially vital information such as the last date the file was accessed or modified can also be changed. Because of this, the discovery and disclosure process must be carefully controlled and monitored if we are to maintain the evidential integrity of the documents in our case.

Tell Me When To Stop

The sheer quantity of available information can also create difficulties for us. The extent to which the discovery and disclosure of material is increased through the inclusion of digital evidence can be both a blessing and a curse. In a recent case of IPR theft, DataSec recovered more than 6000 relevant documents from 330 gigabytes of data seized during the execution of an Anton Pillar order. The search, equivalent to a review of more than 630,000 documents was completed in only 4½ days. Whilst the quantity of recovered documents and the time in which the search was completed would put most paper discoveries in the shade, the client’s lawyers still had significant quantity of documents to assess.

As a result of the ease with which data can be distributed between computers, multiple copies of a document may exist throughout an organisation. Distribution can occur through everything from the humble floppy disk to the use of writeable CDs to the sending of files as attachments to Email. Such proliferation needs to be identified and filtered out unless, of course, it is pertinent to the action we are pursuing!

Proliferation also highlights another problem. Where do we look for the documents? Most modern offices, from the smallest SOHO through SMEs and on to the big Corporations now employ some kind of network with which to connect their computers. These networks may hold documents centrally on dedicated computers known as Servers. These computers may be located in other countries and consequently other jurisdictions. Even the Internet can be a potential store for documents relevant to our case. Space can be rented on computers anywhere in the World on which documents can be stored and accessed via the Internet and Email services, such as HotMail, are Internet based (HotMail’s servers are located in the USA).

Another digital benefit is the ability to recover documents that have been deleted. Because of the “way computers work”, if a document has been stored or manipulated on a computer, even though the User may have deleted it, fragments or even complete copies can be routinely recovered. Kenneth Withers, in his article “Is Digital Different? Electronic Disclosure and Discovery in Civil Litigation” , describes this as the “Vampire Effect”. In other words, no matter how hard you try to kill it, it will always return.

 

 

Work Smart

For the Digital Document Discovery process to produce manageable results the scope of the discovery needs to be carefully defined. Part 31 of the Civil Procedure Rules attempts to direct the definition of this scope through the “Rules of Disclosure and Inspection”. For example, Rule 31.6 requires that a party disclose only the documents on which he relies or that may “adversely affect his own case”, while Rule 31.12 provides for “specific disclosure or specific inspection” which allows searches according to a defined extent. However, as always rules are subject to interpretation and can fall under the heading of “Legal Argument”. So what can we do to better control the process?

Perhaps the most obvious approach is to make the technology do the hard work for us. As we are in a digital environment we can make use of the power of computers. We can search thousands of documents for words or phrases that are relevant to our case in a matter of minutes. We can create “digital fingerprints” of specific documents we may be looking for. We can investigate the extensive “audit trail” that is created whenever a computer event, such as the creation or opening of a document occurs (remember, these audit trails are themselves potentially “documents” under rule 31.4).

The engine that drives this technology is intelligence. In much the same way that we need a sound briefing to develop our knowledge of what we should be looking for in a paper discovery, the creation of a solid intelligence picture can help identify where we should be looking, the “keywords” we should be searching for, the documents we should be “fingerprinting” etc. From this we can start to develop an appropriate strategy for discovery and disclosure.

Where Do We Go From Here?

Evidence that we discover from digital sources has the potential to remain digital throughout the litigation process. Indeed, some “documents” lose so much relevant detail if printed that they become incoherent and unusable. As a result, this type of data can only be effectively presented in an electronic format. A draft Practice Direction produced by the Society for Computers and Law provides advice on how to identify and deal with such issues and reach agreement on their resolution. There are obvious benefits to be gained by keeping digital documents in an electronic environment. The entire results of a Document Disclosure can be stored on one or more CDROMs instead of dozens of lever-arch files. The savings in paper and ink alone can be quite substantial (not to mention environmentally friendly!).

The power of digital examination and presentation is not only restricted to documents that have always existed in electronic format. The benefits can be extended to paper evidence. Paper documents can be scanned to create electronic versions that can be indexed and catalogued to allow searches for words and phrases. The ordeal of preservation and access can be greatly simplified bringing the same benefits that we see with purely digital evidence with obvious efficiency and cost savings.

The traditional paper discovery and disclosure process will always be with us but, the benefits to had from Digital Document Discovery are clear and unambiguous. The digital age is not just coming, it is here with us now. Meet the challenges that it brings with open arms and reap the benefits because the future isn’t just bright, it’s digital!

 

Civil Procedure Rules
http://www.dca.gov.uk/civil/procrules_fin

 

International Data Corporation,
“Worldwide Email Usage Forecast, 2003-2007”

http://www.idc.com

 

University of California,
“How Much Information 2003”

http://www.sims.berkeley.edu/research
/projects/how-much-info-2003

 

Ken Withers 1999,
“Is Digital Different?
Electronic Disclosure and Discovery
inCivilLitigation”
http://www.kenwithers.com/articles/bileta/

 

Society for Computers and Law,
“Draft Practice Directions for the
use of IT in Civil Proceedings”

http://www.scl.org

 



   
Search WWW Search The Barrister