Professional Documents
Culture Documents
https://dictionary.cambridge.org/
http://www.collinsdictionary.com/
http://www.merriam-webster.com/
http://thesaurus.com/
http://www.eurodict.com/
http://www.lingvozone.com/free-online-dictionary
24.3.2018 г.
Picture dictionaries
http://www.oxfordlearnersdictionaries.com/wordli
st/english/pictures/pics_A-B/
http://www.kidzwood.com/#a-z
Pictorial dictionary with voice
http://www.opdome.com/
Online picture dictionary
http://www.anglomaniacy.pl/index.html
Vocabulary & Grammar
Songs & Printables
24.3.2018 г.
Learning English can be fun
Crosswords and word searches
http://puzzlemaker.discoveryeducation.com/WordSearchSetupForm
.asp?campaign=flyout_teachers_puzzle_wordcross
http://worksheets.theteacherscorner.net/make-your-own/word-
search/
http://www.teachers-direct.co.uk/resources/wordsearches/
http://tools.atozteacherstuff.com/word-search-
maker/wordsearch.php
http://www.puzzle-maker.com/WS/
http://www.armoredpenguin.com/crossword/
http://www.abcya.com/make_a_word_search.htm
24.3.2018 г.
Word search example
African Animals
24.3.2018 г.
Word search example
24.3.2018 г.
Word search example
24.3.2018 г.
Word search example
24.3.2018 г.
Interactive games
http://gamestolearnenglish.com/
http://www.vocabulary.co.il/english-language-
games/
http://www.learninggamesforkids.com/vocabulary
_games/foreign-languages.html
http://www.eslgamesplus.com/
http://www.abcya.com/
...
24.3.2018 г.
ANNOTATED CORPORA
24.3.2018 г.
LEMMATIZATION
am, are, is be
car, cars, car's, cars' car
24.3.2018 г.
PART-OF-SPEECH TAGGING
Parts of speech
The morphological and syntactic classes that the
different parts of speech can be assigned to.
POS tagging
Automatic assignment of descriptors called tags
to input tokens.
24.3.2018 г.
THE TAGSET
The tagset includes all the tags that will be
used in the POS tagging.
We could use a very coarse tagset:
N, V, Adj, Adv, Prep...
The _ DT
little _ JJ
boy_NN1
quickly_RB
ate_ VVD
the_DT
green_JJ
apple_NN1
./.
24.3.2018 г.
CASES OF AMBIGUITY
24.3.2018 г.
CASES OF AMBIGUITY
They_PNP Time_NN1_VVB
are_VBB flies_VBZ_NNS
flying_NN1_VBG like_PRP_VVB
planes_NN2 an_AT0
./.
arrow_NN1
Coreference
./.
Resolution
24.3.2018 г.
Examples of Taggers and Parsers
24.3.2018 г.
Applications for formula reading
www.readspeaker.com
http://www.robobraille.org/robobraille-projects
http://www.inftyproject.org/en/index.html
http://lpf-esi.fe.up.pt/~audiomath/demo/AM1.htm
24.3.2018 г.
BRITISH NATIONAL CORPUS (BNC)
24.3.2018 г.
Results for collocates of black
24.3.2018 г.
Search for: Verb + Preposition
24.3.2018 г.
24.3.2018 г.
SEMANTICALLY-BASED QUERIES OF THE CORPUS
24.3.2018 г.
LEXICAL SEMANTIC NETWORKS
24.3.2018 г.
An example of classical taxonomy tree
adult child
[+adult] [-adult]
24.3.2018 г.
Lattice structure with multiple classifications
24.3.2018 г.
WORDNET - http://wordnet.princeton.edu/
24.3.2018 г.
WORDNET STRUCTURE
Nouns, verbs, adjectives and adverbs are grouped
into sets of cognitive synonyms (synsets), each
expressing a distinct concept.
Each synset is linked to other synsets by means of a
small number of “conceptual relations.”
WordNet really consists of four sub-nets, one each
for nouns, verbs, adjectives and adverbs, with few
cross-POS pointers.
24.3.2018 г.
WORDNET STRUCTURE
http://wordnet.princeton.edu/man/wngloss.7WN.html
Each synonym set - SYNSET - encodes the relation of
equivalence between a number of lexical items –
LITERALS where each lexeme:
has unique meaning (specified by the value of SENSE)
pertains to one and the same part of speech
(specified as the value of POS)
represents one and the same lexical meaning
(specified as the value of DEF - definition)
24.3.2018 г.
An example: learn (Wordnet)
24.3.2018 г.
BulNet http://dcl.bas.bg/bulnet/
A lexical semantic network of Bulgarian
comprises around 49,189 synonym sets
distributed into nine parts of speech
open-class words: nouns, verbs, adjectives and
adverbs
closed-class words: pronouns, prepositions,
conjunctions, particles and interjections
24.3.2018 г.
STRUCTURE
Each synset is linked to its counterpart in PWN3.0 by
means of a unique identification number – ID.
The common synsets in the Balkan languages are
marked as common concepts subsets – BCS.
In the monlingual database a synset should be linked to
at least one other synset through an intralingual
relation.
Non-obligatory information may also be encoded such
as examples of usage, stylistic, morphological or
syntactic properties.
24.3.2018 г.
RELATIONS IN BULNET
Synonymous sets are linked through various relations:
SEMANTIC
Synonymy, antonymy, hypernymy, hyponymy, meronymy,
holonymy, entailment, inclusion, causation, etc.
MORPHOSEMANTIC
BE IN STATE
MORPHOLOGICAL
DERIVED
PARTICLE
EXTRALINGUISTIC
24.3.2018 г.
SEMANTIC RELATIONS
SYNONYMY – a semantic relation of equivalence
between literals belonging to the same POS;
The synonyms form the synonym set also called
SYNSET.
For example:
The lexical units
{auto:1, car:2, automobile:2, machine:3, motorcar:1}
form a synset as they refer to the same concept.
24.3.2018 г.
SEMANTIC RELATIONS
HYPERNYMY and HYPONYMY - semantic relations between
synsets, which corresponds to the notion of class-
inclusion: if W1 is a kind of W2, then W2 is
hypernym of W1 and W1 is hyponym of W2.
Example:
rose < plant < living organism
Multi-parent relations:
actress < actor
actress < female.
24.3.2018 г.
SEMANTIC RELATIONS
ANTONYMY – a semantic relation of opposition,
established between two members belonging to
one and the same POS.
Examples:
man - woman
Hyponyms of two antonyms (nouns) should also be
antonymous pair by pair:
man - woman
actor - actress
24.3.2018 г.
SEMANTIC RELATIONS
MERONYMY and HOLONYMY – semantic relations linking
synsets denoting wholes with those denoting their
parts: if W1 has a W2, and W2 is part, portion,
member of W1, then W1 is a meronym of W2 and
W2 is a holonym of W1
Examples:
...
MERONYMY may not be always reversible to HOLONYMY:
tree - forest
24.3.2018 г.
TYPES OF MERONYMY
PART OF:
клон – дърво
книга - библиотека
MEMBER OF:
дърво – гора
футоболист – отбор – лига
PORTION OF:
капка – течност
24.3.2018 г.
WORD-FORMING RELATIONS
Morpho-semantic relations
BE IN STATE
кост – костен
Morphological relations
DERIVED
сервирам – сервитьор
PARTICLE
видя – видян
24.3.2018 г.
EXTRALINGUISTIC RELATIONS
REGION DOMAIN
степ – Русия
USAGE DOMAIN
тиранти – множествено число
CATEGORY DOMAIN
гимнастически уред – гимнастика
24.3.2018 г.
THE RELATIONS IN BULNET
24.3.2018 г.
BulNet http://dcl.bas.bg/bulnet/
24.3.2018 г.
APPLICATIONS
24.3.2018 г.
FRAMENET (Fillmore and Baker 2001, 2010)
A lexical database of English that is both human-
and machine-readable.
Based on annotated examples of how words are
used in actual texts.
Tries to capture human insight into how a word
can be used and converts it into semantic
knowledge that is machine-readable.
Available online at:
http://www.icsi.berkeley.edu/~framenet
24.3.2018 г.
FRAME SEMANTICS (Fillmore, 1976, 1985)
24.3.2018 г.
FrameNet Data: Frame Index
24.3.2018 г.
FrameNet Data: Frame Index
24.3.2018 г.
FrameGrapher
24.3.2018 г.
FrameGrapher
24.3.2018 г.
FrameGrapher
24.3.2018 г.
FrameGrapher
24.3.2018 г.
Uses of Electronic Language Resources
EDUCATION
Intelligentsearches for particular language
phenomena, i.e. search by (combinations of) word
forms, grammatical tags, semantic relations;
Collocations;
Translation equivalents;
etc.
24.3.2018 г.
References
24.3.2018 г.
References
24.3.2018 г.
THANK YOU
FOR YOUR ATTENTION!
24.3.2018 г.