Transactions of KarRC RAS :: Scientific publications
Transactions of KarRC RAS :: Scientific publications

Transactions of KarRC RAS :: Scientific publications
Karelian Research Centre of RAS
ISSN (print): 1997-3217
ISSN (online): 2312-4504
Transactions of KarRC RAS :: Scientific publications
Background Editorial committee Editorial Office For authors For reviewer Russian version
Transactions of KarRC RAS :: Scientific publications

Electronic Journal OJS



Series

Biogeography

Experimental Biology

Mathematical Modeling and Information Technologies

Precambrian Geology

Ecological Studies

Limnology and Oceanology

Research in the Humanities (2010-2015)

Region: Economy and Management (2012-2015)



Issues

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

1999-2008




SCIENTIFIC PUBLICATIONS
А.Н. Кириллов, А.А. Крижановский.
Модель геометрической структуры синсета
A.N. Kirillov, A.A. Krizhanovsky. Synset geometry structure model // Transactions of Karelian Research Centre of Russian Academy of Science. No 8. Mathematical Modeling and Information Technologies. 2016. Pp. 45-54
Keywords: synonym; synset; neural network; corpus linguistics; word2vec; RusVectores; gensim; Russian Wiktionary
The goal of formalization, proposed in this paper, is to bring together, as near as possible, the theoretic linguistic problem of synonym conception and the computer linguistic methods based generally on empirical intuitive unjustified factors. Using the word vector representation we have proposed the geometric approach to mathematical modeling of synset. The word embedding is based on the neural networks (Skip-gram, CBOW), developed and realized as word2vec program by T. Mikolov. The standard cosine similarity is used as the distance between word-vectors. Several geometric characteristics of the synset words are introduced: the interior of synset, the synset word rank and centrality. These notions are intended to select the most significant synset words, i.e. the words which senses are the nearest to the sense of a synset. Some experiments with proposed notions, based on RusVectores resources, are represented.
Indexed at RSCI


  Last modified: September 15, 2016