Satori-Punctum  Search Word Frequency List

"Satori" and "Punctum" Fiction Corpus

The corpus consists of fiction (both original and translated works) published in the internet periodicals "Satori" and "Punctum".

Corpus size 3.5M words (4.3M tokens)
Data period 2003–2025
Development period 2025
Developers Institute of Mathematics and Computer Science UL
Funding EU Recovery and Resilience Facility "Language Technology Initiative" (2.3.1.1.i.0/1/22/I/CFLA/002); State Research Programme "Digital Humanities" (VPP-IZM-DH-2022/1-0002)