Daily we read, see and hear more and more about the challenges associated with organizational and individual productivity in the world of social media. This week’s cartoon and clip highlights one proven technique for increasing social media productivity (cartoon) and several cool social media tools that may be useful for increasing social media productivity (clip).
Rolling intelligence is the enterprise-level equivalent of pay it forward. Effort invested in one business unit, functional area, or type of document storage to visually classify documents benefits the other units, areas, or types of storage that are processed later on. It is the gift that keeps on giving. The reason is that there is a heavy overlap on the types of documents that are used or stored in different areas of an enterprise.
Daily we read, see and hear more and more about the challenges and concerns associated with predictive coding. This week’s cartoon and clip highlights a visual depiction of two knowledge workers taking a random sampling approach to predictive coding (cartoon) and some considerations for thinking about the challenges associated with textual analytics-based technology-assisted review platforms. (clip).
Published on November 10, 2014, the new Gartner Magic Quadrant for Enterprise Information Archiving (G00262936) provides information technology and business professionals with information and insight into solutions available to meet compliance and eDiscovery challenges while reducing primary storage costs.
Daily we read, see and hear more and more about the organizational risks associated with social media use. This week’s cartoon and clip highlights a unique approach to dealing with social media risk (cartoon) and some considerations for thinking about and evaluating organizational risk and cost related to social media (clip).
In a sea of 600+ e-discovery providers in the US alone, trying to find the right vendor to meet and fulfill your requirements is difficult. Like purchasing a car, you have a choice of vendors that range from local, to regional and national providers. Some that use their own technologies, others that use off-the-shelf products and a few others that provide traditional processing and hosting services spawned from the paper world.
Perhaps the most important conclusion of the study was that an advanced TAR 2.0 protocol, continuous active learning (CAL), proved to be far more effective than the two standard TAR 1.0 protocols used by most of the early products on the market today—simple passive learning (SPL) and simple active learning (SAL).
In a recent blog post on cross-border e-discovery, Sasha L. Hefler and Chris Dale discuss the differences between discovery in the United States and abroad, and the resulting challenges. As they point out, discovery in the United States is much more broad than other common law countries, which put limits on discovery in terms of scope—requiring proportionality—and use of personally identifiable information and other private data. While such differences in approach pose challenges in terms of cross-border discovery, these differences may also hold lessons to be learned for those looking to achieve more reasonable and proportionate discovery here in the United States.
By Bernard Marr We surely see a lot of hype surrounding big data but I believe the following 25 facts speak for themselves and help to paint a realistic picture of the phenomenon we now call ‘Big Data’ – a phenomenon that is changing the world as we know it. Every 2 days we create […]
Daily we read, see and hear more and more about the challenges of managing unstructured data stores in information governance and eDiscovery. This week’s cartoon and clip highlights a unique records management challenge faced by many corporations today (cartoon) and a research firm’s aggregated listing of 28 file analysis software vendors than can contribute to increased efficiency in information management decisions for unstructured data (clip).
Storage of Big Data–and particularly unstructured data such as video, PPT presentations, and Word documents that do not fit into a database–has been the subject of considerable discussion. While proper, secure storage was a natural evolution in the Big Data debate lifecycle, it is now merely a precondition to much higher orders of analysis and business intelligence. In this respect, the Big Data remains too small in many minds.
An overview of Technology-Assisted Review and Predictive Coding related case law, reports and articles as compiled by Robert Brownstone of Fenwick & West.
“So what’s the big deal?” I asked Mark Noel, one of our senior Predict consultants (and much smarter than me about this stuff). “Moving from one document in 100 to seven doesn’t seem like much of an improvement,” I added. “Why couldn’t we get these numbers up to 35% or, heck, even higher to 60% or more?”
Daily we read, see and hear more and more about the cost, time and complexity factors that impact information governance and eDiscovery sourcing decisions. This week’s cartoon and clip features a contemplative look at post-decision stress syndrome (cartoon) and a short reminder of some of the key elements to consider when making important sourcing decisions (clip).
Founded in 2004, Equivio has become one of the world’s leading provider of text analysis software for information governance and eDiscovery. Currently they have generated increased interest based on the reported potential acquisition of the company by Microsoft. Provided in this post is a quick reference listing of select technology, governmental and commercial entities that are currently represented as part of Equivio’s installed base or have been mentioned between 2005 and today on the Equivio website in the form of a press release.
Big data and the “internet of things” — in which everyday objects can send and receive data — promise revolutionary change to management and society. But their success rests on an assumption: that all the data being generated by internet companies and devices scattered across the planet belongs to the organizations collecting it. What if it doesn’t?
In my previous post, I found that relevance and uncertainty selection needed similar numbers of document relevance assessments to achieve a given level of recall. I summarized this by saying the two methods had similar cost. The number of documents assessed, however, is only a very approximate measure of the cost of a review process, and richer cost models might lead to a different conclusion.
One distinction that is sometimes made is between the cost of training a document, and the cost of reviewing it. It is often assumed that training is performed by a subject-matter expert, whereas review is done by more junior reviewers. The subject-matter expert costs more than the junior reviewers—let’s say, five times as much. Therefore, assessing a document for relevance during training will cost more than doing so during review.
Daily we read, see and hear more and more about technology developments that impact the areas of information governance and electronic discovery. This week’s cartoon and clip features a unique look at innovative thinking in these critical areas (cartoon) and quick reference links to six interesting blogs that regularly highlight the need to truly think about these critical areas (clip).
A critical metric in Technology Assisted Review (TAR) is recall, which is the percentage of relevant documents actually found from the collection. One of the most compelling reasons for using TAR is the promise that a review team can achieve a desired level of recall (say 75% of the relevant documents) after reviewing only a small portion of the total document population (say 5%). The savings come from not having to review the remaining 95% of the documents.
Try as I might to make it foolproof, downloading Gmail using IMAP and Outlook is tricky. Happily since my post, the geniuses at Google introduced a truly simple, no-cost way to collect Gmail and other Google content for preservation and portability.
Daily we read, see and hear more and more about the health, legal and business developments in the pharmaceutical industry. This week’s cartoon and clip features a unique look at the retail sales impact of the pharmaceutical industry (cartoon) and a quick reference link to one of the most informational and timely resources on the pharmaceutical industry, FiercePharma (clip).
Published on September 23, 2014, the new Gartner Market Guide for File Analysis Software (G00262949) provides information technology and business professionals with information and insight into more efficient, less costly and less risky ways to manage what is generally regarded as unstructured data through the use of file analysis software.
Daily we read, see and hear more and more about the latest privacy and data security breaches in consumer, corporate and governmental arenas. This week’s cartoon and clip features a unique approach to ensuring personal data security (cartoon) and a quick reference link to one of the most informational and timely resources on privacy and data security, the LXBN Privacy & Data Security Blog Channel (clip).
By K&L Gates Dynamo Holdings Ltd. P’ship v. Comm’r of Internal Revenue, Nos. 2685-11, 8393-12 (T.C. Sept. 17, 2014) In this case, the court approved petitioners’ (Dynamo Holdings Ltd. Partnership et. al.) use of predictive coding to identify potentially responsive and privileged data contained on two backup tapes, despite respondent’s (Commissioner of Internal Revenue) objection […]