Next: 2.3 Milestonesdeliverables and
Up: 2.2 The Workplan
Previous: Work Part #integration>:
Work Part 5 - ``Evaluation'' is organized in the following
way.

A short description of WP4 as a whole follows.
- WP5 - Objectives
- The objective of Work Part 5 is
the investigation of the efficiency and effectiveness of the theories and
of the related prototypical implementations developed within Work Parts
1 to 4. This is done both from a theoretical
viewpoint, i.e. by studying the properties of the proposed theories in
terms of computational complexity, and from an experimental
viewpoint, i.e. by first devising and then running experiments that are
adequate for understanding the effectiveness of information retrieval
systems based on this logic.
- WP5 - Approach
- The computational complexity of the proposed
theories will be studied by first relying on standard analysis techniques,
and then, if negative results were obtained from these, by relying on
techniques dealing with ``probabilistic'' computational complexity. In
order to study the retrieval effectiveness of the prototypes resulting from
Work Parts 1 to 4, instead, retrieval experiments
will be performed and the results will be compared with those of similar
approaches.
- WP5 - Expected results
- The computational properties of the
theories resulting from Work Parts 1 to 3 and of
the MIRLOG logic will have been characterized. Retrieval results will show
the overall retrieval quality that can be achieved based on these theories
and on the MIRLOG logic. In addition, these results will show strengths and
weaknesses of our approach, thus indicating possibilities for further
improvements.
Work Part 4 is further structured into Tasks T41 to T45. We now
give a concise description of the objectives, approaches taken, and results
expected from each of these tasks.
- T51 - Objectives
- The objective of T51 ``Computational
studies'' is to assess the characteristics of the theories resulting from
Work Parts 1 to 4 in terms of computational
complexity.
- T51 - Approach
- The approach that will be followed in T51 will be
to use the standard analysis techniques of computational complexity. In
case that the results of this evaluation are negative (i.e. in case these
theories turn out to have bad computational properties), techniques from
probabilistic computational complexity will be used in order to try to
assess whether these theories admit instead a tractable algorithm whose
answers are correct only with probability close to 1. In such a case,
given the imprecision already inherently present in the information
retrieval endeavour, the imprecision introduced by a probabilistic
algorithm would be negligible, and the problem would be deemed tractable.
- T51 - Expected results
- The result expected from T51 is a
characterization of the computational properties of the the theories
resulting from Work Parts 1 to 4, in
deterministic terms and (if these were to be negative) in probabilistic
terms. Results of this task will obviously influence the course of action
that will be taken in the accomplishment of Work Part 4..
- T52 - Objectives
- In T52 ``Design of experiments'', retrieval
experiments for evaluating the effectiveness of the theories resulting from
Work Parts 1 to 4 are designed. This includes the
experimental design as well as the setup of a prototypical system for
performing the retrieval experiments.
- T52 - Approach
- We will use the routing queries from the TREC
collection for setting up retrieval experiments. This means that besides
the query formulation itself, a number of documents with relevance
judgements are available for each query. Based on this data, query-specific
knowledge bases can be set up. Other research groups with similar (advanced)
representation techniques have taken the same approach. In addition, there
are retrieval figures for a number of other approaches to which we can
compare our final results. For the experiments, it has to be clearly
defined which knowledge sources are used and what kinds of manual
intervention is allowed. The prototypical system will be implemented by
coupling the prototypes developed by the different groups with a search
engine like e.g. SMART.
- T52 - Expected results
- The result of this task will be a precise
specification of the experiments to be performed, and a prototypical IR
system.
- T53 - Objectives
- In Task T53 ``Data acquisitions'', the data
required for performing the retrieval experiments has to be collected.
- T53 - Approach
- Since the UNIDO-CS group is participating in the
TREC initiative, the queries, documents and relevance judgements can be
provided by this group. In addition, there are retrieval figures for a
number of other approaches to which we can compare our results. Based on
this material, the queries will be transformed manually into queries in
terms of the specific theories to be evaluated. Furthermore, query-specific
knowledge bases will be set up semi-automatically by considering the
feedback documents.
- T53 - Expected results
- As a result of this task, a collection of
experimental data is provided which also can be used by other research
groups for performing similar experiments.
- T54 - Objectives
- In Task T54 ``Experiments'', the retrieval
runs are performed in order to assess the effectiveness of retrieval based
on the various theories to be evaluated.
- T44 - Approach
- By using the prototypical system set up in T42 and
the data from T43, first retrieval runs are performed. Then effectiveness
figures are computed by using standard evaluation measures.
- T44 - Expected results
- Retrieval output lists, overall and
query-specific evaluation figures will be the results of this task.
- T55 - Objectives
- In the final task ``Analysis of the
experimental results'', the results from T54 are analysed.
- T55 - Approach
- The overall figures show the overall quality of
the various approaches tested, whereas query-specific comparisons with other
approaches indicate specific strengths and weaknesses; furthermore, these
results may point towards possible improvements.
- T55 - Expected results
- Besides figures for the overall quality of
MIRLOG-based retrieval, we expect findings about its strengths and
weaknesses and indications for further improvements.
The participating (P) consortium members for each of the Tasks in
Work Part 4 are listed in the following table.

Next: 2.3 Milestonesdeliverables and
Up: 2.2 The Workplan
Previous: Work Part #integration>: