Posts Tagged ‘information governance’

Docker And The Rise of Open Source

Docker And The Rise of Open Source

There’s never been a phenomenon like Docker . Eighteen months ago, the company took its core technology, which enables IT people to move software easily between different machines by enclosing it in “containers”, and made it open source.


Rolling Intelligence: Enterprise-Level Paying It Forward

Rolling Intelligence: Enterprise-Level Paying It Forward

Rolling intelligence is the enterprise-level equivalent of pay it forward. Effort invested in one business unit, functional area, or type of document storage to visually classify documents benefits the other units, areas, or types of storage that are processed later on. It is the gift that keeps on giving. The reason is that there is a heavy overlap on the types of documents that are used or stored in different areas of an enterprise.


Best Practices: Advanced eDiscovery Search

Best Practices: Advanced eDiscovery Search

As the size of document collections continue to explode, finding the evidence needle in the ESI haystack is more challenging than ever. Understanding the latest tools, indexing techniques, and features associated with advanced eDiscovery keyword search will allow you to conduct a highly effective review while still delivering your production on time.


Are You Paying Too Much for eDiscovery Processing?

Are You Paying Too Much for eDiscovery Processing?

“Free eDiscovery Processing” sounds too good to be true. Until now, you may have spent many hundreds of dollars per GB to process native documents like Outlook Email and Microsoft Office files into paginated reviewable formats like TIFF or PDF. So how can those charges simply disappear? The revolutionary answer lies with Lexbe’s secure, scalable cloud technology.


How Much Can I Save With CAL? A Closer Look at the Grossman/Cormack Research Results

How Much Can I Save With CAL? A Closer Look at the Grossman/Cormack Research Results

Perhaps the most important conclusion of the study was that an advanced TAR 2.0 protocol, continuous active learning (CAL), proved to be far more effective than the two standard TAR 1.0 protocols used by most of the early products on the market today—simple passive learning (SPL) and simple active learning (SAL).


Cross-Border E-Discovery: Challenges and Lessons to be Learned

Cross-Border E-Discovery: Challenges and Lessons to be Learned

In a recent blog post on cross-border e-discovery, Sasha L. Hefler and Chris Dale discuss the differences between discovery in the United States and abroad, and the resulting challenges. As they point out, discovery in the United States is much more broad than other common law countries, which put limits on discovery in terms of scope—requiring proportionality—and use of personally identifiable information and other private data. While such differences in approach pose challenges in terms of cross-border discovery, these differences may also hold lessons to be learned for those looking to achieve more reasonable and proportionate discovery here in the United States.


Top 10 Mistakes to Avoid When Creating Productions

Top 10 Mistakes to Avoid When Creating Productions

Whether producing in Native, TIFF, PDF, or a blended format, discovery productions are fraught with potential challenges and obstacles that can foil your ability to meet deadlines and satisfy discovery requirements. Here we present 10 of the most common mistakes litigation professionals can encounter, as well as best practices to avoid them.


Big Data: 25 Amazing Need-to-Know Facts

Big Data: 25 Amazing Need-to-Know Facts

By Bernard Marr We surely see a lot of hype surrounding big data but I believe the following 25 facts speak for themselves and help to paint a realistic picture of the phenomenon we now call ‘Big Data’ – a phenomenon that is changing the world as we know it. Every 2 days we create […]


What To Do When Your Big Data Isn’t Big Enough

What To Do When Your Big Data Isn't Big Enough

Storage of Big Data–and particularly unstructured data such as video, PPT presentations, and Word documents that do not fit into a database–has been the subject of considerable discussion. While proper, secure storage was a natural evolution in the Big Data debate lifecycle, it is now merely a precondition to much higher orders of analysis and business intelligence. In this respect, the Big Data remains too small in many minds.


Redefining High Speed eDiscovery Processing & Production

Redefining High Speed eDiscovery Processing & Production

91% of firms engaged in litigation continue to use TIFFs as a favored review format. On the one hand, TIFFs are a standardized, reviewable format that litigation professionals are comfortable with. On the other, TIFF processing requires tremendous computing resources and, as a result, has traditionally been slow and expensive. As the size of collections continue to explode, there is additional pressure to process more native documents into TIFF – and fast.


Got Equivio? A Quick Reference to Highlighted Relationships

Got Equivio? A Quick Reference to Highlighted Relationships

Founded in 2004, Equivio has become one of the world’s leading provider of text analysis software for information governance and eDiscovery. Currently they have generated increased interest based on the reported potential acquisition of the company by Microsoft. Provided in this post is a quick reference listing of select technology, governmental and commercial entities that are currently represented as part of Equivio’s installed base or have been mentioned between 2005 and today on the Equivio website in the form of a press release.


Best Practices: Defensibly Collecting, Reviewing, and Producing Email

Best Practices: Defensibly Collecting, Reviewing, and Producing Email

Emails and their attachments represent an increasingly significant portion of ESI (Electronically Stored Information) collections and for good reason, too. The hundreds of billions of emails that are sent daily paint a comprehensive picture of our personal and professional lives. It’s no wonder that litigators must thoroughly and effectively review these collections for key evidence. Most often, the “smoking guns” are hiding in Outlook .msg files and their attachments. But the peculiarities of email format can make this key evidence difficult to find, defensibly collect, process, review, search, and produce without proper knowledge and tools.


What If Companies Don’t Own All That Data They’re Collecting?

What If Companies Don’t Own All That Data They’re Collecting?

Big data and the “internet of things” — in which everyday objects can send and receive data — promise revolutionary change to management and society. But their success rests on an assumption: that all the data being generated by internet companies and devices scattered across the planet belongs to the organizations collecting it. What if it doesn’t?


Preserving Gmail for Dummies

Preserving Gmail for Dummies

Try as I might to make it foolproof, downloading Gmail using IMAP and Outlook is tricky. Happily since my post, the geniuses at Google introduced a truly simple, no-cost way to collect Gmail and other Google content for preservation and portability.


Top 10 Things You Need to Know About Email

Top 10 Things You Need to Know About Email

Emails and their attachments represent an increasingly significant portion of ESI (Electronically Stored Information) collections and for good reason, too. The hundreds of billions of emails that are sent daily paint a comprehensive picture of our personal and professional lives, so it is no wonder that litigators must thoroughly and effectively review these collections for relevant case material. All too often, the “smoking gun” is hiding in .msg files and their attachments, but the peculiarities of email format can make this key evidence difficult to find, process for review, search, and produce.


LIVE Launch of the Big Data Public-Private Partnership

LIVE Launch of the Big Data Public-Private Partnership

Joint press briefing by Neelie KROES, Vice-President of the EC in charge of Digital Agenda, and Jan SUNDELIN, CEO of TIE Kinetix.   See the original brief posting at: LIVE Launch of the Big Data Public-Private Partnership Key Facts to Share@EU_Commission LIVE Launch of the #BigData Public-Private Partnership Tweet Buffer


Microsoft Said to Sign Letter to Acquire Israel’s Equivio

Microsoft Said to Sign Letter to Acquire Israel’s Equivio

Microsoft Corp. (MSFT) has signed a letter of intent to acquire Israeli text-analysis startup Equivio, said a person familiar with the company’s plans, as the software maker bulks up products for analyzing data.


Best Practices: NearDup Identification

Best Practices: NearDup Identification

Near duplicate identification, or ‘NearDup’, is a critically important eDiscovery function that can drastically increase the speed and quality of your review by grouping similar documents, maintaining email threads, retrieving unmarked ‘hot’ documents, and preventing the inadvertent release of critical privileged information. As document collections continue to grow, so does the risk of missing key documents, inconsistently coding productions, and releasing privileged information.


Best Practices: Litigation Timelining

Best Practices: Litigation Timelining

As the volume of discoverable data continues to increase, creating functional fact and issue timelines is more important than ever. During the early stages of litigation, timelines can help you develop eDiscovery strategies, identify collection sources, predict disputed facts and issues, and create a preliminary chronological relationship between case events. And as your case progresses, timelines can assist you when preparing for depositions, carrying out motion practice, and conducting trial, as well. Associating key documents with timeline events allows you to track the truly critical data in the increasingly murky sea of irrelevant emails and documents subject to discovery collection. Most importantly, a well managed fact timeline enables you to present the best case possible.


Best Practices: Litigation Document Management

Best Practices: Litigation Document Management

Increasing volumes of electronically stored information (ESI) in litigation have created the need for highly specialized document management systems to organize and manage discovery review. Though litigation databases have existed for many years, there are more options than ever for legal professionals to choose from that can deliver faster and more cost effective document reviews. All this while still including the most advanced eDiscovery and case management functionality available.


High-Speed eDiscovery Processing and Production: A New Approach

High-Speed eDiscovery Processing and Production: A New Approach

The Lexbe eDiscovery Processing System (LEPS) converted the entire 53 GBs of the EDRM Enron Dataset, with 5 million page equivalents, into industry-standard TIFF images in only 5.3 hours. To accomplish this task in this short time, LEPS programmatically deployed and utilized over 60 parallel server instances, and maintained a sustained throughput rate of 240 GBs/day (23 Million pages/day) for Native to TIFF processing.


Black Swans and Information Governance

Black Swans and Information Governance

In the absence of a black swan recently happening to you and your organization, how can you convince the powers that be that they should take some preventive and/or precautionary course of action to stave off a subsequent disaster? These questions have direct relevance to the matter of “selling” information governance to the C-suite in our increasingly Big Data world.


Visual Classification for Information Governance: A Concise Explanation

Visual Classification for Information Governance: A Concise Explanation

BR’s visual classification technology enables data management, analysis and governance tasks. BR technology automates the collection, reduction, classification and governance of large volumes of data. It is unique in the fact that it supports data in any file structure, format or type within one platform.


New Survey Shows eDiscovery Workload, Predictive Coding Use Increasing

New Survey Shows eDiscovery Workload, Predictive Coding Use Increasing

eDiscovery workload, the use of predictive coding and projected rate of adoption of technically assisted review are all up significantly, according to a new report by recruiting and staffing firm The Cowen Group.