There’s never been a phenomenon like Docker . Eighteen months ago, the company took its core technology, which enables IT people to move software easily between different machines by enclosing it in “containers”, and made it open source.
Rolling intelligence is the enterprise-level equivalent of pay it forward. Effort invested in one business unit, functional area, or type of document storage to visually classify documents benefits the other units, areas, or types of storage that are processed later on. It is the gift that keeps on giving. The reason is that there is a heavy overlap on the types of documents that are used or stored in different areas of an enterprise.
As the size of document collections continue to explode, finding the evidence needle in the ESI haystack is more challenging than ever. Understanding the latest tools, indexing techniques, and features associated with advanced eDiscovery keyword search will allow you to conduct a highly effective review while still delivering your production on time.
“Free eDiscovery Processing” sounds too good to be true. Until now, you may have spent many hundreds of dollars per GB to process native documents like Outlook Email and Microsoft Office files into paginated reviewable formats like TIFF or PDF. So how can those charges simply disappear? The revolutionary answer lies with Lexbe’s secure, scalable cloud technology.
Perhaps the most important conclusion of the study was that an advanced TAR 2.0 protocol, continuous active learning (CAL), proved to be far more effective than the two standard TAR 1.0 protocols used by most of the early products on the market today—simple passive learning (SPL) and simple active learning (SAL).
In a recent blog post on cross-border e-discovery, Sasha L. Hefler and Chris Dale discuss the differences between discovery in the United States and abroad, and the resulting challenges. As they point out, discovery in the United States is much more broad than other common law countries, which put limits on discovery in terms of scope—requiring proportionality—and use of personally identifiable information and other private data. While such differences in approach pose challenges in terms of cross-border discovery, these differences may also hold lessons to be learned for those looking to achieve more reasonable and proportionate discovery here in the United States.
Whether producing in Native, TIFF, PDF, or a blended format, discovery productions are fraught with potential challenges and obstacles that can foil your ability to meet deadlines and satisfy discovery requirements. Here we present 10 of the most common mistakes litigation professionals can encounter, as well as best practices to avoid them.
By Bernard Marr We surely see a lot of hype surrounding big data but I believe the following 25 facts speak for themselves and help to paint a realistic picture of the phenomenon we now call ‘Big Data’ – a phenomenon that is changing the world as we know it. Every 2 days we create […]
Storage of Big Data–and particularly unstructured data such as video, PPT presentations, and Word documents that do not fit into a database–has been the subject of considerable discussion. While proper, secure storage was a natural evolution in the Big Data debate lifecycle, it is now merely a precondition to much higher orders of analysis and business intelligence. In this respect, the Big Data remains too small in many minds.
91% of firms engaged in litigation continue to use TIFFs as a favored review format. On the one hand, TIFFs are a standardized, reviewable format that litigation professionals are comfortable with. On the other, TIFF processing requires tremendous computing resources and, as a result, has traditionally been slow and expensive. As the size of collections continue to explode, there is additional pressure to process more native documents into TIFF – and fast.
Founded in 2004, Equivio has become one of the world’s leading provider of text analysis software for information governance and eDiscovery. Currently they have generated increased interest based on the reported potential acquisition of the company by Microsoft. Provided in this post is a quick reference listing of select technology, governmental and commercial entities that are currently represented as part of Equivio’s installed base or have been mentioned between 2005 and today on the Equivio website in the form of a press release.
Emails and their attachments represent an increasingly significant portion of ESI (Electronically Stored Information) collections and for good reason, too. The hundreds of billions of emails that are sent daily paint a comprehensive picture of our personal and professional lives. It’s no wonder that litigators must thoroughly and effectively review these collections for key evidence. Most often, the “smoking guns” are hiding in Outlook .msg files and their attachments. But the peculiarities of email format can make this key evidence difficult to find, defensibly collect, process, review, search, and produce without proper knowledge and tools.
Big data and the “internet of things” — in which everyday objects can send and receive data — promise revolutionary change to management and society. But their success rests on an assumption: that all the data being generated by internet companies and devices scattered across the planet belongs to the organizations collecting it. What if it doesn’t?
Try as I might to make it foolproof, downloading Gmail using IMAP and Outlook is tricky. Happily since my post, the geniuses at Google introduced a truly simple, no-cost way to collect Gmail and other Google content for preservation and portability.
Emails and their attachments represent an increasingly significant portion of ESI (Electronically Stored Information) collections and for good reason, too. The hundreds of billions of emails that are sent daily paint a comprehensive picture of our personal and professional lives, so it is no wonder that litigators must thoroughly and effectively review these collections for relevant case material. All too often, the “smoking gun” is hiding in .msg files and their attachments, but the peculiarities of email format can make this key evidence difficult to find, process for review, search, and produce.
Joint press briefing by Neelie KROES, Vice-President of the EC in charge of Digital Agenda, and Jan SUNDELIN, CEO of TIE Kinetix. See the original brief posting at: LIVE Launch of the Big Data Public-Private Partnership Key Facts to Share@EU_Commission LIVE Launch of the #BigData Public-Private Partnership Tweet Buffer
Microsoft Corp. (MSFT) has signed a letter of intent to acquire Israeli text-analysis startup Equivio, said a person familiar with the company’s plans, as the software maker bulks up products for analyzing data.
Near duplicate identification, or ‘NearDup’, is a critically important eDiscovery function that can drastically increase the speed and quality of your review by grouping similar documents, maintaining email threads, retrieving unmarked ‘hot’ documents, and preventing the inadvertent release of critical privileged information. As document collections continue to grow, so does the risk of missing key documents, inconsistently coding productions, and releasing privileged information.
As the volume of discoverable data continues to increase, creating functional fact and issue timelines is more important than ever. During the early stages of litigation, timelines can help you develop eDiscovery strategies, identify collection sources, predict disputed facts and issues, and create a preliminary chronological relationship between case events. And as your case progresses, timelines can assist you when preparing for depositions, carrying out motion practice, and conducting trial, as well. Associating key documents with timeline events allows you to track the truly critical data in the increasingly murky sea of irrelevant emails and documents subject to discovery collection. Most importantly, a well managed fact timeline enables you to present the best case possible.
Increasing volumes of electronically stored information (ESI) in litigation have created the need for highly specialized document management systems to organize and manage discovery review. Though litigation databases have existed for many years, there are more options than ever for legal professionals to choose from that can deliver faster and more cost effective document reviews. All this while still including the most advanced eDiscovery and case management functionality available.
The Lexbe eDiscovery Processing System (LEPS) converted the entire 53 GBs of the EDRM Enron Dataset, with 5 million page equivalents, into industry-standard TIFF images in only 5.3 hours. To accomplish this task in this short time, LEPS programmatically deployed and utilized over 60 parallel server instances, and maintained a sustained throughput rate of 240 GBs/day (23 Million pages/day) for Native to TIFF processing.
In the absence of a black swan recently happening to you and your organization, how can you convince the powers that be that they should take some preventive and/or precautionary course of action to stave off a subsequent disaster? These questions have direct relevance to the matter of “selling” information governance to the C-suite in our increasingly Big Data world.
BR’s visual classification technology enables data management, analysis and governance tasks. BR technology automates the collection, reduction, classification and governance of large volumes of data. It is unique in the fact that it supports data in any file structure, format or type within one platform.
eDiscovery workload, the use of predictive coding and projected rate of adoption of technically assisted review are all up significantly, according to a new report by recruiting and staffing firm The Cowen Group.