📝Clint's Notes

Search

❯

artificial intelligence

❯

natural language processing

❯

❯

Skip-Gram

Jun 06, 20251 min read

predicting the context given a word **w_t
Let wt-1,…,wt-m, wt+1,…,wt+m be the context
Pr(w_t | context) * Pr (context) = Pr(context | w_t) * Pr(w_t)
Pr(context) and Pr(w_t) are uniform distributions and are constants
Pr(context | w_t) = Product { Pr(w_j | w_t) } for all js

Word2Vec is a skip-gram model

Graph View

Backlinks

Word2Vec

Created with Quartz v4.2.3 © 2025

GitHub
Discord Community