KSAT: Corpus Christi officer arrested on theft charges in Kendall County, placed on administrative leave

KENDALL COUNTY, Texas – A Corpus Christi police officer was accused and arrested on three theft-related charges after investigators learned he used a credit card connected to a police nonprofit ...

Corpus Christi officer arrested on theft charges in Kendall County, placed on administrative leave

kristv: Arrest records detail witness account in fatal collision that killed 14-Year-Old

I would read in the BCC corpus frequency list as a dictionary, then Having concatenated all the news/magazine articles as plain text, I would build a dictionary of all the words in the news/magazine articles up to 8 characters long, counting their number of occurrences with the help of the BCC frequency list (which tells us which combinations ...

Word frequency list based on a 15 billion character corpus: BCC (BLCU ...

I guess in my case, I could go with per-corpus flashcard sets to keep the per-corpus tagging, and one user dictionary (without tags) with all the per-corpus ranking info included in one entry per term.

The BCC corpus seems to have pretty loose licensing terms. Pleco already seems to be using frequency data to sort the search results. Adding them meaningfully to dictionary definitions would be even better, I believe. That is something which printed dictionaries can’t do.

The Beijing Language and Culture University created a balanced corpus of 15 billion characters. It’s based on news (人民日报 1946-2018,人民日报海外版 2000-2018), literature (books by 472 authors, including a significant portion of non-Chinese writers), non-fiction books, blog and weibo entries as well as...