42
Evaluation
• Evaluation based on most stringent relevance set (strict
intersection)
• Compared systems using
– MAP across all topics
– Number of topics with no relevant image in the top 100
• 4 participants evaluated (used captions only):
– NTU – Chinese->English, manual and automatic, Okapi and
dictionary-based translation, focus on proper name translation
– Daedalus
– all->English (except Dutch and Chinese), Xapian and
dictionary-based + on-line translation, Wordnet query expansion,
focus on indexing query and ways of combining query terms
– Surrey
– all->English (except Chinese), SoCIS system and on-line
translation, Wordnet expansion, focus on query expansion and
analysis of topics
– Sheffield
– all->English, GLASS (BM25) and Systran translation,
no language-specific processing, focus on translation quality