Appendix A – Some basic IR definitions
basic metrics used to quantify success in an IR task are Recall and
precision. They are given here as they are used in several references
without explanation, and are defined:
evaluating Ontology coverage are presented in Extracting
Ontologies from Software Documentation: a Semi-Automatic Method and
its Evaluation, for evaluation of
the success of a semi-automatically created Ontology in contrast to a
manually created one (the gold standard).
lexical overlap (LO) equals to the ratio of the number of
concepts shared by both Ontologies and the number of concepts we wish
LO1 is the set of all the concepts extracted by the tested method and
LO2 the set of concepts of the Gold Standard.
Ontology improvement (OI) equals the ratio of new concepts
extracted by the tested nethod (expressed as the set difference
between extracted and desired pairs) and all pairs of the gold
Salton Index is an important measure of co-occurrence which is
not biased by naturally high occurrence of certain keywords. It is
- The number of co-occurrences of x and y.
Cx - is the number of
occurrences of x.
Cy - is the number of occurrences
Back to the Document