Dr. Olga Pelloni
Senior Research Engineer, NLP

Senior Research Engineer, NLP
Web development
Developing a website structure and design using WordPress and Elementor. Editing of header images.
2024 University of Twente (remote project)
Project management, Flask web development
Project management for the scientific fair Scientifica 2021. Web development of a text generation app in four languages using GPT-2. Backend debugging and full frontend development.
Short interview about the project (in German):
2021 University of Zurich
Python package development
Participating in the Python package development, creating the architecture of classes, cleaning the code. Designing the logo.
2020 University of Zurich
Flask website, web development
Full stack development of a multimedia corpus.
Information about the project (Online publication of the Zurich Tangram Corpus)
Link to the corpus (available only from the UZH VPN)
2018—2019 University of Zurich
Django website + Neo4j database, web development
Russian rhyme database is the first web resource for finding Russian rhymes with references to the actual verse lines from the Russian poetry (from the 18th century to the first third of the 20th century). Full stack development.
2016 HSE University, Moscow
R Shiny application, web development
Game of guessing a Bayes factor (metric from Bayesian statistics) given a scatter plot with regression lines.
2016 University of Tübingen
Crawling texts
Web crawling for the corpus of modern texts written in Thai language.
2015—2016 HSE University, Moscow
Frontend development
2015 HSE University, Moscow
Frontend development
Research group “Corpus instruments for Yiddish studies”
Link to the paper
2014 HSE University, Moscow
Research on frequency of Russian verb forms
Freaky Frequency is an information system based on the collection of Russian word forms and their frequency.
2013 HSE University, Moscow
Data visualization for my talk Subword Geometry: Picturing Word Shapes at the workshop SIGTYP 2021, co-located with NAACL 2021.
2021 University of Zurich
Data visualization for the paper
Gutierrez-Vasques, X., C. Bentz, O. Sozinova and T. Samardzic (2021). From characters to words: the turning point of BPE merges.
In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 3454—3468.
2021 University of Zurich
JavaScript based on amCharts
Dynamic overview of the number of tokens gathered for different languages and genres in TeDDi Sample (stats as per year 2020).
2020 University of Zurich
D3.js
Dynamic visualizations of the Russian rhymes' clusters. Links to the visualizations of different time periods:
18th century
19th century, 1st third
19th century, 2nd third
19th century, last third
20th century, 1st third
Related links:
Abstract for DH2016
Project description (in Russian)
2016 HSE University, Moscow, Russia