Fri. Apr 19th, 2024
ARCHIVED CONTENT
You are viewing ARCHIVED CONTENT released online between 1 April 2010 and 24 August 2018 or content that has been selectively archived and is no longer active. Content in this archive is NOT UPDATED, and links may not function.
 




Extract from article by Herbert L. Roitblat, Ph.D.

Even though many predictive coding tools yield respectable results, they do have differences.  Among these differences is their sensitivity to class noise (inconsistency) in the training set.  As you might expect, the greater the inconsistency in the coding of the training documents, the poorer is the performance of the machine learning system.  For the most part, this inconsistency has rarely been examined in eDiscovery, but we do have enough information to say that the greater the number of people categorizing the training documents, the higher the expected level of inconsistency in their judgments (i.e., the higher the noise).  Truth discovery methods could be used to reduce the class noise, but these methods can become expensive.

Not every eDiscovery case merits detailed examination of the accuracy of the search process.  But knowledge of the variables that affect that accuracy can help to select the right tools and methods.  In addition to the variables described in the introduction to this article, we should add (5) class noise in the training set.

 

Have a Request?

If you have information or offering requests that you would like to ask us about, please let us know, and we will make our response to you a priority.

ComplexDiscovery OÜ is a highly recognized digital publication focused on providing detailed insights into the fields of cybersecurity, information governance, and eDiscovery. Based in Estonia, a hub for digital innovation, ComplexDiscovery OÜ upholds rigorous standards in journalistic integrity, delivering nuanced analyses of global trends, technology advancements, and the eDiscovery sector. The publication expertly connects intricate legal technology issues with the broader narrative of international business and current events, offering its readership invaluable insights for informed decision-making.

For the latest in law, technology, and business, visit ComplexDiscovery.com.

 

Generative Artificial Intelligence and Large Language Model Use

ComplexDiscovery OÜ recognizes the value of GAI and LLM tools in streamlining content creation processes and enhancing the overall quality of its research, writing, and editing efforts. To this end, ComplexDiscovery OÜ regularly employs GAI tools, including ChatGPT, Claude, Midjourney, and DALL-E, to assist, augment, and accelerate the development and publication of both new and revised content in posts and pages published (initiated in late 2022).

ComplexDiscovery also provides a ChatGPT-powered AI article assistant for its users. This feature leverages LLM capabilities to generate relevant and valuable insights related to specific page and post content published on ComplexDiscovery.com. By offering this AI-driven service, ComplexDiscovery OÜ aims to create a more interactive and engaging experience for its users, while highlighting the importance of responsible and ethical use of GAI and LLM technologies.