Suppose you have been given a "black box" function A that will take any two words (assume I mean tokens) and return a statistical association score for that pair of words, where a higher score means the two words tend to be associated with each other. (I'm deliberately leaving that notion vague.) If you had a large set of French-English sentence translations (i.e. parallel text, e.g. Canadian Parliament proceedings in French and English), how would you use those translations plus black-box function A to build a French-English bilingual dictionary?
And here's the way I meant to say it.
Suppose I give you a technique that will take a large set of word
(token) pairs of the form
The second version is more interesting, but feel free to answer either. The main point is to get you thinking about ways to use statistical association measures, because we're going to cover that idea next class.