Archive for the ‘law’ Category

Meeting with John Sheridan on the Semantic Web and Public Administration

Tuesday, August 11th, 2009

I met today with John Sheridan, Head of e-Services, Office of Public Sector Information, The National Archives, located at the Ministry of Justice, London, UK. Also at the meeting was John’s colleague Clare Allison. John and I had met at the ICAIL conference in Barcelona, where we briefly discussed our interests in applications of Semantic Web technologies to legal informatics in the public sector. Recently, John got back in contact to talk further about how we might develop projects in this area.

Perhaps most striking to me is that John made it clear that the government (at least his sector) is proactive, looking for research and development projects that make government data available and usable in a variety of ways. In addition, he wanted to develop a range of collaborations to better understand the opportunities the Semantic Web may offer.

As part of catching up with what is going on, I took a look around the web for relatively recent documents on related activities.

In our discussion, John gave me an overview of the current state of affairs in public access to legislation, in particular, the legislative markup and API. The markup is intended to support publication, revision, and maintenance of legislation, among other possibilities. We also had some discussion about developing an ontology of goverment which would be linked to legislation.

Another interesting dimension is that John’s office is one of a few that I know of which are actively engaged to develop a knowledge economy partly encouraged by public administrative requirements and goals. Others in this area are the Dutch and the US (with xml.gov). All very promising and discussions well worth following up on.

Copyright © 2009 Adam Wyner

Legal Taxonomy

Saturday, May 16th, 2009

Introduction
In this post, I comment on Sherwin’s recent article Legal Taxonomy in the journal Legal Theory. It is a very lucid, thorough, and well-referenced discussion of the state-of-the-art in taxonomies of legal rules. By considering how legal taxonomies organise legal rules, we better understand current conceptions of legal rules by legal professionals. My take away message from the article is that the analysis of legal rules could benefit from some of the thinking in Linguistics and Computer Science, particularly in terms of how data is gathered and analysed.

Below, I briefly outline ideas concerning taxonomies and legal rules. Then, I present and comment on the points Sherwin brings to the fore.

Taxonomies
Taxonomy is the practice and science of classification of items in a hierarchical IS-A relationship, where the items can be most anything. The IS-A relationship is also understood as subtypes or supertypes. For example, a car is a subtype of vehicle, and a Toyota is a subtype of car; we can infer that a Toyota is a subtype of vehicle. Each subtype has more specific properties than the supertype. In some taxonomies, one item may be a subtype of several supertypes; for example, a car is both a subtype of vehicle and a subtype of objects made of metal, however, not all vehicles are made of metal, nor are all things made from metal vehicles, which indicates that these types are distinct. Taxonomies are more specific than the related term ontologies, for which a range of relationships beyond the IS-A relationship may hold among the items such as is owned by or similar. In addition, ontologies generally introduce properties of elements in the class, e.g. colour, engine type, etc. Classifications in scientific domains such as Biology or Linguistics is intensely debated and revised. It would be expected that this would be even more so true in the legal domain which is comprised of intellectual evidence rather than empirical evidence as in the physical sciences and where the scientific method is not applied.

Legal Rules
First, let us be clear about what a legal rule is with a clear example following Professor David E. Sorkin’s example . A legal rule is a rule which determines whether some proposition holds (say of an individual) contingent on other propositions (the premises). For example, the state of Illinios assault statute specifies: “A person commits an assault when, without lawful authority, he engages in conduct which places another in reasonable apprehension of receiving a battery.” (720 ILCS 5/12-1(a)). We can analyse this into the legal rule:

    A person commits assault if

      1. the person engages in conduct;
      2. the person lacks lawful authority for the conduct;
      3. the conduct places another in apprehension of receiving a battery; and
      4. the other person’s apprehension is reasonable.

Optimally, each of the premises in a rule should be simple and be answerable as true or false. In this example, where all four premises are true, the conclusion, that the person committed assault, is true.

There are significant issues even with such simple examples since each of the premises of a legal rule may itself be subject to further dispute and consideration; the premises may be subjective (e.g. was the conduct intentional), admit degrees of truth (e.g. degree of emotional harm), or application of the rule may be subject to mitigating or aggravating circumstances. The determination of the final claim follows the resolution of these subsidiary disputes and considerations. In addition, some legal rules need not require all of the premises to be true, but allow a degree of counterbalancing evaluation of the terms.

The Sources of Legal Rules
Sherwin outlines the sources of the rules:

      Posited rules, which are legal rules as explicitly given by a legal authority such as a judge giving a legal decision.
      Attributed rules, which are legal rules that are drawn from a legal decision by a legal researcher rather than by a legal authority in a decision. The rule is implicit in the other aspects of the report of the case.
      Ideal rules, which are rules that are ‘ideal’ relative to some criteria of ideality, say morally or economically superior rules.

Purposes of Classification
In addition, we have the purposes or uses of making a classification of legal rules.

      Facilitating the discussion and use of law.
      Supporting the critical evaluation of law
      Influencing legal decision-making

In the first purpose, the rules are sorted into classes, which helps to understand and manage legal information. In Sherwin’s view, this is the most basic, formal, and least ambitious goal, yet it relies on having some taxonomic logic in the first place. The second purpose, the rules are evaluated to determine if they are serving the intended purpose as well as to identify gaps or inconsistencies. As Sherwin points out, the criteria of evaluation must then also be determined; however, this then relates to the criteria which guides the taxonomy in the first place, a topic we touch on below. The final purpose is a normative one, where the classification identifies the normal circumstances under which a rule applies, thereby also clarifying those circumstances in which the rule does not apply. Sherwin points out that legal scholars vary in which purpose they find attractive and worth pursuing.

While I can appreciate that some legal scholars might not find the ‘formal’ classification of interest, I view it from a different perspective. First, any claim concerning the normative application of one rule instead of another rest entirely on the intuitive presumption that the rules are clearly different. This is a distinction that the first level can help to clarify. Similar points can be made for other relationships among rules. Second, focusing on the latter stage does not help to say specifically why one rule means what it does and has the consequences as intended; yet surely this is in virtue of the specific ‘content’ of the rule, which again is clarified by a thorough going analysis at the first stage. Third, if there is going to be any progress in applied artificial intelligence and law, it will require the analytical elements defined at the first stage. Fourth, as the study of Linguistics has shown, close scrutiny at the first stage can help to reveal very issues and problems that are fundamental to all higher stages. Fifth, providing even a small, clear sample of legal arguments analysed along other lines of the first stage can give the community of legal scholars a common ‘pool’ of legal arguments to fruitfully consider at the later stages; along these lines, it is notable how few concrete, detailed examples Sherwin’s paper discusses. Not surprisingly, some of the issues Sherwin raises about the purposes of different ‘levels’ of analysis also appear in the linguistic literature. In my view, though the first stage may not be interesting to most legal professionals, there are very good reasons why it should be.

Criteria of Taxonomy
Several different criteria which guide the taxonomy of legal rules are discussed.

      Intuitive similarity: whether researchers claim that two rules are subtypes of one another.
      Evolutionary history: the legal rule is traced in the history of the law.
      Formal classification: the logical relations among categories of the law.
      Function based: a function from the problem to a set of solutions.
      Reason based: the higher-level reasons that explain or justify a rule.

Sherwin criticises judgements based on intuitive similarity since the taxonomers may be relying on false generalisations rather than their own intuitions and that intuition can be arbitrary and without reason. This is also the sort of criticism leveled at large segment of linguistic research and which has been shown to be misleading. Of course, one must watch false classifications and try to provide a justification for classifying one element in one class and not another. One way to do this is, as in psycholinguistics, is to provide tests run over subjects. Another way is to refine the sorts of observations that lead to classifications. In general, all that we currently know about language, from dictionaries, to grammars, to inference rules is based on linguistic intuitions. Some, such as the rules of propositional logic, have been so fixed that they now seem to exist independent of any linguistic basis.

The issue here is somewhat related to classification by formal logical relations. It is unclear what Sherwin thinks logical relations are and how they are applied. What we do have more clarity on are some of the criteria for such a formal taxonomy: accounting for all legal materials, a strict hierarchy, consistent interpretation of classes, and no overlap of categories. This is but one way to consider a formal hierarchy; indeed, there is a separate and very interesting question about what formal model of classification best suits a legal taxonomy. Yet, this issue is not explored in the article.

The function based approach seems to have meta categories. For example, the rule above can be seen as a function from circumstances to a classification of a person as having committed an assault. However, this is not what appears to be intended in Sherwin’s discussion. Rather, there are meta-functional categories depending on higher level problems and solutions. The examples given are Law as a Grievance-Remedial Instrument and Law as an Administrative-Regulatory Instrument. For me, this is not quite as clear as Sherwin makes it appear.

The reason approach organises rules according to an even higher-level of the rule — the justification or explanation of the rule. Some of the examples are that a wrongful harm imposes an obligation for redress, deterring breaches of promises facilitate exchange, or promoting public safety. In my view, these are what people (e.g. Professor Bench-Capon) in AI and Law would call values which are promoted by the legal rule. Sherwin discusses several different ways that reason based classification is done: intended, attributed, and ideal rationales. In my view, the claimed differences are not all that clear or crucial to the classification. In some cases, the rationale of a legal rule is given by the adjudicator. However, where this is not so, the rationale is implicit and must be interpreted, which is to give the intended rationale. In other cases, legal researchers examine a body of law and provide rationales, which is the attributed rationale. In this sense, the intended and attributed rationales are related (both interpreted), but achieved by different methods (study of one case versus study of a body of cases and considerations about the overall purpose of the law). Finally, there are ideal rationales, which set out broad, ideal goals of the legal rule, which may or may not be ‘ideally’ achievable. In this, the difference between intended/attributed and ideal is whether the rationale is analysed out of cases (bottom-up) or provided legislatively (top-down). In the end, the result is similar — legal rules are classified with respect to some rationale. The general problem with any such rationale is just how it is systematically given and itself justified so as to be consistent and not to yield conflicting interpretations of the same legal rule. Finally, Sherwin seems to think that there is some intrinsic conflict or tension between formal classification and reason based classification. I don’t agree. Rather, the difference is in the properties and methods being employed to make the classification, which are not inherently in conflict. Likely, a mixed approach will yield the most insights.

Copyright © 2009 Adam Wyner

Susskind’s “The End of Lawyers” is Part of the Story

Thursday, April 16th, 2009

Introduction

In this post, I briefly outline Richard Sussking’s background, elements from The End of Lawyers, and then turn to consider issues that Susskind is aware of but does not discuss in depth. These are issues which I believe are fundamental to how technology will impact legal practice such as the semantic web, textual information extraction, ontologies, and open source databases of legal documents.

Background

Susskind specialises in how information and communication technology (ICT) is used by lawyers and public administrators. His website is:

www.susskind.com

Besides the important and general interest of his line of work, its prominence in the community of practicing legal professionals gives us a good indication of the sorts of technologies that community is and is not aware of.

Richard Susskind has been writing about ICT since publication of his PhD thesis Expert Systems in Law (1987, Oxford University Press). He is among the early researchers in Artificial Intelligence and the Law. His subsequent books — The Future of Law and Transforming the Law — developed themes about the relation of ICT and the legal profession, focusing on the ways ICT would change the practice of law and the interactions among lawyers, government administrators, and the public. In addition to the books, Susskind consults widely, is an editor of the journal International Journal of Law and Information Technology, and is a law columnist for The Times. He is very uniquely informed about the technologies that are available and how the legal community regards and uses them. This makes it all the more interesting to draw attention to what he does not discuss in depth.

His recent book The End of Lawyers has garnered a very significant amount of attention, and online excerpts along with comments can be found at:

The End of Lawyers

Legal Technology Tools

In this book, he develops and elaborates his main themes. He points out a range of technologies, briefly outlined below, which will contribute to changing the legal profession. As there is substantial information already on line about his proposals, I will not here repeat them in depth, but to say that by and large I agree with many of the overt points he makes about the applicability of technology to the legal profession as well as why the legal profession has been and remains slow to take up ICT solutions.

Among the key technologies Susskind outlines, we find:

      Automated document assembly — structuring blocks of legal documents.
      Connectivity — email, fax, cell phones, facebook, twitter, blogs.
      Electronic legal marketplace — legal services advertised, rated, and traded.
      E-learning — lawyers and members of the public having the opportunity to learn about the law online.
      Online legal guidance — rather than face-to-face with individual lawyers, a chance to read, learn about the law, have questions addressed at different levels of formality.
      Legal open-sourcing — user generated content, free and unrestricted legal information (e.g. BAILII), legal wikis.
      Closed legal communities — collectives of lawyers, justices, or government officials exchange information.
      Workflow and project management — using software and services to monitor and support the work of legal professionals. This includes case-management and electronic filing.
      Embedded legal knowledge — legal information and knowledge is more readily transparent in daily interactions or prevents non-compliance.
      E-disclosure — finding and processing documents and information relevant to the disclosure phase of a case.
      Online dispute resolution — systems to mediate and support the resolution of disputes.
      Courtroom annotation — transcribing and noting courtroom proceedings manually and automatically.
      Improving access to law — giving citizens more information and advice.

Engineering and Managing Legal Knowledge

In the course of the book, he says that the engineering and management of legal knowledge is central to these technologies, where:

      Legal knowledge management (p. 155) — the systematic organization, standardization, preservation, and exploitation of the collective knowledge of a firm. It is intended to maximize the firm’s return on the combined experience of its lawyers over time.
      Legal knowledge engineer (p. 272): someone who carries out basic analysis, decomposition, standardization, and representation of legal knowledge in computer systems.

However, little is said about how the engineering and management is to be done other than that some of the technologies outlined above contribute to them.

What is said is largely by way of brief references or outlines to additional issues such as the semantic web (p. 68), wikis (but not semantic wikis), online dispute resolution (but little on current developments), and open source legal information (e.g. BAILII, but not WorldLii).

More to the point, there is no discussion of research on key technologies such as:

      Legal ontologies by which legal knowledge is formalised, acquired, processed, and managed.
      XML which underlies the semantic web
      Web-based inference systems
      Textual information extraction which is essential to make use of open source legal information
      Rule-based systems such as provided by Oracle (previously known as Softlaw, RuleBurst, and Haley) which are prominently used by UK tax authorities
      E-government services which go beyond providing information and submission of forms but also allow some interaction such as Parmenides and DEMO-net

These are all topics of central relevance to our blog and to the AI and Law community which organises around the International Conference on AI and Law or Jurix

We agree by and large with Susskind. However, there is much more which would be highly relevant and valuable to draw to the attention of the legal community. Moreover, it would be very valuable to the AI and Law community were his prominent and respected voice in the legal and governmental circles to be heard advocating further for research such as in AI and Law.

Copyright © 2009 Adam Wyner

Legal Track of the Text Retrieval Conference Series

Saturday, March 14th, 2009

The Text Retrieval Conference (TREC) is an annual workshop on text retrieval from large text collections. It is sponsored by the National Institute of Standards and Technology, which is an agency of the US Commerce Department and started in 1992. In The goal of the legal track is to develop search technology that meets the needs of lawyers to engage in effective discovery in digital document collections. In 2006, a legal track was added to the conference, and there have been annual tracks the last three years.

The stated goal of the legal track is to develop text search technology to help lawyers discover information in digital document corpora. Papers from the track are published as part of the proceedings of TREC.

In the legal track, researchers are set a variety of tasks and topics among which they can choose to apply their search techniques. Let’s consider one, which was proposed in 2008 and continued in 2009, the Interactive Task, for which we have task guidelines and topics from 2008.

For 2008, the task is to search for documents that relate to topic, which is a single 16 page class action complaint that a tobacco company committed fraud, from among a document population of nearly 7 million documents of the Legacy Tobacco Document Library of legal case documents involving US tobacco companies. There are a wide range of document genres. The task is to realistically model the way that lawyers develop and refine their searches in the course of the discovery phase of litigation; that is, participants must retrieve a set of documents ‘relevant’ to what ought to be discovered concerning the topic. In the discovery phase, the parties to the suit request material (documents and evidence) concerning the case; e-discovery is the discovery phase involving electronic documents. The task is intended to be more ‘realistic’ in that it allows participants to engage an expert so as to better define the set of documents that are relevant to the topic. Here ‘relevant’ means that the participants recover the same set of documents (from the set of documents available) that a lead litigating attorneys would select; thus, the interaction with an expert who helps define relevance. The success of the participants searches are measured in terms of recall, precision, and a ’summary measure of effectiveness’.

Discovery is a key phase of litigation, concerning the identification of information that is important to the litigators in arguing the case. However, we may consider whether it is central to legal argument itself; the evidence discovered is used in arguing the case as evidence for one claim or another, but it is unclear how distinct this is from any sort of argument where evidence is crucial. For example, in a scientific context, one might argue that a certain protein functions in a certain way to impede cancer growth, then search in the document space for supporting evidence. In other words, there are questions concerning how the task in the Legal Track bears on specifically legal reasoning such as case based reasoning, factor analysis, precedent, and grounding decisions in the law. This would be a rather different and very worthwhile task for the TREC Legal Track.

The TREC Legal Track is very closely related to workshops on e-discovery/e-disclosure DESI which are organised by many of the people involved in the TREC Legal Track.

Copyright © 2009 Adam Wyner