Authors
For over a decade, eDiscovery practitioners and the courts have recognized that machine-learning processes create significant improvements compared to traditional linear document reviews1. Yet emerging applications built on generative AI (Gen AI) technology could significantly impact the accuracy, speed, and cost of eDiscovery document reviews in a radical way, leading practitioners to ask: could these tools one day replace rather than enhance human lawyer oversight for technology-assisted review workflows?
The Sedona Canada Principles Addressing Electronic Discovery (the Sedona Canada Principles) provide an authoritative framework of best practices for the identification, collection, preservation, review and production of electronically stored information (ESI) in Canada2.
Principle 7 of the Sedona Canada Principles states that: “A party may use electronic tools and processes to satisfy its discovery obligations”. The use of technology to improve efficiencies and decrease the costs of document review in eDiscovery has been codified in most provincial rules of civil procedure or practice directions and has been recognized by Canadian courts as falling within the ambits of the “proportionality principle”3. In discovery, the determination of proportionality in the production of documents includes such considerations as (i) whether the time required would be unreasonable, (ii) whether the expense would be unjustified, and (iii) whether the volume of documents required to be produced would be excessive4.
As a practical matter, because most documents that are reviewed as part of the documentary discovery process are now digitally native, and because the volume of ESI for review has increased exponentially, the benefits of technology-assisted workflows in generating efficiencies and cost savings in document review have made the adoption of AI-related technologies a standard part of eDiscovery practices5. While new Gen AI-based workflows purport to advance these benefits by a quantum leap, it is important to understand the underlying foundation to ensure that the trajectory aligns with legal principles and best practices.
Technology Assisted Review (TAR)—also known as “predictive coding”—leverages machine-learning algorithms to identify ESI that is most likely to be relevant or responsive to issues in civil litigation, regulatory reviews or investigations6. TAR is used to classify documents based on relevance, privilege, and responsiveness to the specific issues in the matter. For TAR to work well, the system must be trained by lawyers with deep knowledge of both the case and the relevant eDiscovery software.
In the original TAR (TAR 1.0), a control set of documents chosen by human reviewers is used to train the AI model to identify other documents in the wider data set that would likely be relevant7. The human review team continues training the model until it is considered stable: a determination made through statistical sampling and analysis. Once the TAR model is stable, the classification/ranking algorithm is applied to the entire data set8. Documents with a high relevance score are prioritized for review, leading to significant efficiencies in the time required for eyes-on review.
Continuous Active Learning (CAL)—also known as TAR 2.0—refined TAR workflows by eliminating the need for a control set and reducing the amount of training and statistical analysis required. Unlike the two-step training and review process used in TAR 1.0, document reviews that leverage CAL can begin almost immediately because the CAL algorithm continuously ranks documents according to the decisions of the human review team and serves the highest-ranked documents in priority to the reviewers. In other words, the CAL model “learns” throughout the review process to prioritize documents with a higher relevance ranking at the front of the review queue to ensure that the most highly relevant documents are coded by the review team at the outset of the eDiscovery project. As a result, CAL can reduce the time required to identify relevant documents and can reduce the number of lawyers required to complete a review.
But even though the benefits of technology in document review are well-established, not all document types are appropriate for CAL projects, such as those that are rich in numerical data or images. Further, while TAR reduces the number of reviewers required at first-level review, it does not obviate the need for lawyers to put eyes on documents at second-level review or as part of the privilege review or Quality Control (QC) process9 (for more on how AI tools can be most effectively integrated into legal practice, read “Rules for AI tools: how can legal teams source suitable tech?”).
Unlike earlier iterations of technology-assisted review, emerging Gen AI applications and workflows will require much less initial training or intervention by document reviewers. These new tools fall into three categories10:
As with earlier review technologies, leveraging Gen AI in document review promises to be more efficient, although at present it does not necessarily deliver the same cost savings11. Regardless of which approach is eventually adopted, all Gen AI eDiscovery reviews will continue to require oversight by legal professionals and skilled eDiscovery technologists to ensure accuracy, especially with respect to the protection of legally privileged documents. Ultimately, the predicted time and cost-savings ascribed to future Gen AI-empowered document reviews do not eliminate the professional duty of counsel to exercise legal judgment and to adopt defensible tools and strategies in the course of documentary discovery12.
Emerging eDiscovery applications that leverage generative AI technology could significantly impact the accuracy, speed, and cost of document reviews. While these applications may reshape existing review workflows and processes, they will not eliminate the need for—or the professional duty of—human lawyers to put eyes on documents.
This article was published as part of the Q4 2024 Torys Quarterly, “Machine capital: mapping AI risk”.