Taiwan TOCFL 2023 wordlist with audio (Traditional)

Language

Complete wordlist of TOCFL (Test of Chinese as a Foreign Language), a taiwanese equivalent of HSK.

Parsed from official excel sheets from TOCFL website. This is a new 2022/2023 version (8000zhuyin_202307.zip) of the list with 7517 entries (previous 2018 list had 7945 entries.)

Columns:

  • ID: term's level + index (row number in original excel file which has one sheet per level):
    • L0-1nnn = Novice 1 (準備級一級), L0-2nnn = Novice 2 (準備級二級), both pre-A1, L1-nnnn..L5-nnnn = Level 1..5 (入門級/基礎級/進階級/高階級/流利級) = CEFR A1/A2/B1/B2/C1+..
    • Levels are also added as tags.
  • Traditional: term in traditional characters per TOCFL.
  • Simplified: term converted to simplified characters.
  • Pinyin: pinyin with diacritics, slightly cleaned up from TOCFL sheets, e.g. missing apostrophes added and a few clear errors corrected. Tone changes are not indicated.
  • POS: part of speech, /-separated. See description on TOCFL website for the meaning of abbreviations (202204 list is essentially same)
  • Meaning: definitions from CC-CEDICT for convenience. Note it mainly lists mainland pronunciations which may differ from taiwanese in some cases. CC-BY-SA 4.0 licensed.
  • Audio: good quality neural TTS audio with a taiwanese mandarin voice.
  • Variants: for entries where TOCFL gives multiple variants of a term, an expanded disambiguated list as a JSON list of objects with alternatives column values. If using this deck for an automatic analysis (such as merging with other sources or your anki decks), you might find this field useful as the original source is inconsistent in formatting variants.

Card Previews

No previews available
0 Cards
0 Likes
0 Downloads