# FERNET-C5 **Repository Path**: modelee/FERNET-C5 ## Basic Information - **Project Name**: FERNET-C5 - **Description**: No description available - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 3 - **Forks**: 0 - **Created**: 2023-05-24 - **Last Updated**: 2025-06-20 ## Categories & Tags **Categories**: llm **Tags**: None ## README --- language: "cs" tags: - Czech - KKY - FAV license: "cc-by-nc-sa-4.0" --- # FERNET-C5 FERNET-C5 (**F**lexible **E**mbedding **R**epresentation **NET**work) is a monolingual Czech BERT-base model pre-trained from 93GB of Czech Colossal Clean Crawled Corpus (C5). See our paper for details. ## Paper https://link.springer.com/chapter/10.1007/978-3-030-89579-2_3 The preprint of our paper is available at https://arxiv.org/abs/2107.10042. ## Citation If you find this model useful, please cite our paper: ``` @inproceedings{FERNETC5, title = {Comparison of Czech Transformers on Text Classification Tasks}, author = {Lehe{\v{c}}ka, Jan and {\v{S}}vec, Jan}, year = 2021, booktitle = {Statistical Language and Speech Processing}, publisher = {Springer International Publishing}, address = {Cham}, pages = {27--37}, doi = {10.1007/978-3-030-89579-2_3}, isbn = {978-3-030-89579-2}, editor = {Espinosa-Anke, Luis and Mart{\'i}n-Vide, Carlos and Spasi{\'{c}}, Irena} } ```