This document describes an algorithm aimed at recognizing Named Entities in Polish text, which is powered by two knowledge sources: the Polish Wikipedia and the Cyc ontology. Besides providing the rough types for the recognized entities, the algorithm links them to the Wikipedia pages and assigns precise semantic types taken from Cyc. The algorithm is verified against manually identified Named Entities in the one- million sub-corpus of the National Corpus of Polish
number of pulisher's sheets:
0,5
conference:
Federated Conference on Computer Science and Information Systems (FedCSIS) 2013; 2013-09-08; 2013-09-11; Kraków; Polska; ; ; ;
affiliation:
Wydział Zarządzania i Komunikacji Społecznej : Katedra Lingwistyki Komputerowej