Search Results for author: Arkadiusz Janz

Found 16 papers, 2 papers with code

Propagation of emotions, arousal and polarity in WordNet using Heterogeneous Structured Synset Embeddings

no code implementations • GWC 2019 • Jan Kocoń, Arkadiusz Janz

In this paper we present a novel method for emotive propagation in a wordnet based on a large emotive seed.

regression

Paper
Add Code

Testing Zipf’s meaning-frequency law with wordnets as sense inventories

no code implementations • GWC 2019 • Francis Bond, Arkadiusz Janz, Marek Maziarz, Ewa Rudnicka

According to George K. Zipf, more frequent words have more senses.

LEMMA

Paper
Add Code

A Comparison of Sense-level Sentiment Scores

no code implementations • GWC 2019 • Francis Bond, Arkadiusz Janz, Maciej Piasecki

In this paper, we compare a variety of sense-tagged sentiment resources, including SentiWordNet, ML-Senticon, plWordNet emo and the NTU Multilingual Corpus.

Paper
Add Code

Discriminating Homonymy from Polysemy in Wordnets: English, Spanish and Polish Nouns

no code implementations • EACL (GWC) 2021 • Arkadiusz Janz, Marek Maziarz

We propose a novel method of homonymy-polysemy discrimination for three Indo-European Languages (English, Spanish and Polish).

LEMMA regression

Paper
Add Code

Neural Language Models vs Wordnet-based Semantically Enriched Representation in CST Relation Recognition

no code implementations • EACL (GWC) 2021 • Arkadiusz Janz, Maciej Piasecki, Piotr Wątorski

Neural language models, including transformer-based models, that are pre-trained on very large corpora became a common way to represent text in various tasks, including recognition of textual semantic relations, e. g. Cross-document Structure Theory.

Relation Sentence

Paper
Add Code

Wordnet-based Evaluation of Large Distributional Models for Polish

no code implementations • GWC 2018 • Maciej Piasecki, Gabriela Czachor, Arkadiusz Janz, Dominik Kaszewski, Paweł Kędzia

The paper presents construction of large scale test datasets for word embeddings on the basis of a very large wordnet.

Word Embeddings

Paper
Add Code

Recognition of Hyponymy and Meronymy Relations in Word Embeddings for Polish

no code implementations • GWC 2018 • Gabriela Czachor, Maciej Piasecki, Arkadiusz Janz

Word embeddings were used for the extraction of hyponymy relation in several approaches, but also it was recently shown that they should not work, in fact.

regression Word Embeddings

Paper
Add Code

Context-sensitive Sentiment Propagation in WordNet

no code implementations • GWC 2018 • Jan Kocoń, Arkadiusz Janz, Maciej Piasecki

In this paper we present a comprehensive overview of recent methods of the sentiment propagation in a wordnet.

Paper
Add Code

Personalized Large Language Models

no code implementations • 14 Feb 2024 • Stanisław Woźniak, Bartłomiej Koptyra, Arkadiusz Janz, Przemysław Kazienko, Jan Kocoń

Large language models (LLMs) have significantly advanced Natural Language Processing (NLP) tasks in recent years.

Emotion Recognition Hate Speech Detection +1

Paper
Add Code

BEIR-PL: Zero Shot Information Retrieval Benchmark for the Polish Language

no code implementations • 31 May 2023 • Konrad Wojtasik, Vadim Shishkin, Kacper Wołowiec, Arkadiusz Janz, Maciej Piasecki

In this work, inspired by mMARCO and Mr.~TyDi datasets, we translated all accessible open IR datasets into Polish, and we introduced the BEIR-PL benchmark -- a new benchmark which comprises 13 datasets, facilitating further development, training and evaluation of modern Polish language models for IR tasks.

Information Retrieval Re-Ranking +1

Paper
Add Code

ChatGPT: Jack of all trades, master of none

1 code implementation • 21 Feb 2023 • Jan Kocoń, Igor Cichecki, Oliwier Kaszyca, Mateusz Kochanek, Dominika Szydło, Joanna Baran, Julita Bielaniewicz, Marcin Gruza, Arkadiusz Janz, Kamil Kanclerz, Anna Kocoń, Bartłomiej Koptyra, Wiktoria Mieleszczenko-Kowszewicz, Piotr Miłkowski, Marcin Oleksy, Maciej Piasecki, Łukasz Radliński, Konrad Wojtasik, Stanisław Woźniak, Przemysław Kazienko

Our comparison of its results with available State-of-the-Art (SOTA) solutions showed that the average loss in quality of the ChatGPT model was about 25% for zero-shot and few-shot evaluation.

Chatbot Emotion Recognition +6

Paper
Code

This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish

1 code implementation • 23 Nov 2022 • Łukasz Augustyniak, Kamil Tagowski, Albert Sawczyn, Denis Janiak, Roman Bartusiak, Adrian Szymczak, Marcin Wątroba, Arkadiusz Janz, Piotr Szymański, Mikołaj Morzy, Tomasz Kajdanowicz, Maciej Piasecki

In this paper, we introduce LEPISZCZE (the Polish word for glew, the Middle English predecessor of glue), a new, comprehensive benchmark for Polish NLP with a large variety of tasks and high-quality operationalization of the benchmark.

Benchmarking