Desirable Features of Rule Based Systems for Medical Knowledge

1 Introduction

The past decade has seen amazing advances in our ability to acquire medical facts. These facts range from new clinical or theoretical discoveries (see [WIHIR]) to our ability to collect and utilize large volumes of patient diagnostic and treatment data. This information has the potential to drastically affect peoples medical treatment and lives in both a positive and negative manner. Proper usage of medical knowledge can literally mean life. Errors of ommission or outright mistakes can mean death or injury.

Our investigations so far have shown that the semantic web is an effective mechanism for representing a wide range of medical knowledge. For example, we have been successful in recasting published clinical guidelines (see [TRIP]) in a usable form using n3 and have been able to use a variety of proof engines to validate clinical contexts against such guidelines.

The sheer volume of largely unstructured, constantly changing facts, procedures and guidelines together with the very human need for assurances of correctness and validation against established procedures make this subject area an ideal area for applications of semantic web technologies.

Our experiments to this point made use of [N3] and rules together with systems such as Euler [EULER], CWM [CWM] and Jena [JENA] to model situations requiring both forward and backward reasoning and take into account some experience with describing the semantics for content MathML [MATHML].

In the course of these investigations we have been able to make several observations concerning the rules, the data, and their interaction.

2 Details

The key items that we wish to take into consideration when reviewing the infrastructure of rules are outlined below.

2.1 Refactoring

We have had a continual need to refactor our rules as we discover new factors that come into play and struggle with finding the right combination of facts and meta-like rules such as transitivity versus collections of expanded rules

2.2 Expressiveness.

There has been a continual tradeoff between expressiveness and the decidability of the resulting queries.

2.3 Rapid Authoring

In the early stages of development (at least until visual editing tools mature) the ability to quickly write down meaningful facts and statements in a human readable form has been essential.

The ability to easily author rules in a human understandable format is essential to developments in this area. History has shown that this was essential to early development of the Web. This only becomes less important after usable editors become widely available. Another case in point is MathML. Given the complexity of writing advanced semantic constructs in MathML the successful authoring of MathML is closely tied to the use of tools.

2.4 Validation.

There is a need for easy methods of validation of both syntax and against a chosen structure. In the long term, this may be addressed by authoring tools, but quick convenient validation is essential to successful authoring.

2.5 Named Formal Definitions.

As we work with different implementations we have developed a library of Rules. There would appear to be a need include rules by name.

It is important to allow for systems that support different logical models. How do we advertise the capabilities of a system? How do we behave when desired functionality is not supported?

We need to at least consider mechanisms for naming individual rules. This can have considerable impact on our ability to quickly author and experiment. Conventions such as name spaces in order to be clear when a named rule is really the same rule is essential.

2.6 Extendability.

As the collection of proof based technologies matures and different balances between expressivness and decidability are explored, the need to plan for extendability becomes important.

We should not have to re-design the language to add new functionality. There are potential lessons to be learned in the design of content MathML and in its appendices providing more formal mathematical definiions.

2.7 Interoperability.

We have needed to consult a variety of proof engines and tools during our investigations. while a diversity of features has been benificial, the notions of discoverability and interoperability emerge, and might even benefit from the notion of a compatibilty layer.

During the course of our investigations we have had the constant need to feed results from one system into another. One of the key outcomes of this activity must be to establish that interoperability.

2.8 Re-Use of Knowledge.

Througout, it has proved helpful to be able to separate the core "re-usable" knowledge from temporal or state specific knowledge and to use meta-structure whereever possible.

The actual data that is produced by a proof must also be valid input to the next proof, or for direct addition to our knowledge pool.

3 Conclusions

We do not presume to have absolute answers to the questions raised here. Nor do we presume that such a list is complete. An effective design will be the result of balancing these needs and most of all through buy-in by the stake-holders. A satisfactory outcome of such a workshop would be some consensus of where to draw the line and an explanations of how the emerging design addresses each of these questions. We look forward to working with participants to move forward in an organized fashion.

Desirable Features of Rule Based Systems for Medical Knowledge

Agfa Note 15 March 2005

Abstract

Status of this Document

Table of Contents

Appendix