深度學習Course5第二周Natural Language Processing & Word Embeddings習題整理

2022-10-21 16:39:50

Natural Language Processing & Word Embeddings

True/False: Suppose you learn a word embedding for a vocabulary of 20000 words. Then the embedding vectors could be 1000 dimensional, so as to capture the full range of variation and meaning in those words.

解析：The dimension of word vectors is usually smaller than the size of the vocabulary. Most common sizes for word vectors range between 50 and 1000.

True/False: t-SNE is a linear transformation that allows us to solve analogies on word vectors.

解析：tr-SNE is a non-linear dimensionality reduction technique.

Suppose you download a pre-trained word embedding which has been trained on a huge corpus of text. You then use this word embedding to train an RNN for a language task of recognizing if someone is happy from a short snippet of text, using a small training set.

深度學習Course5第二周Natural Language Processing & Word Embeddings習題整理

Then even if the word “ecstatic” does not appear in your small training set, your RNN might reasonably be expected to recognize “I’m ecstatic” as deserving a label.

解析： word vectors empower your model with an incredible ability to generalize. The vector for “ecstatic” would contain a positive/happy connotation which will probably make your model classify the sentence as a “1”.(泛化能力增強)

Which of these equations do you think should hold for a good word embedding? (Check all that apply)

Let be an embedding matrix, and let be a one-hot vector corresponding to word 4567. Then to get the embedding of word 4567, why don’t we call

The correct formula is

解析：the element-wise multiplication will be extremely inefficient.

When learning word embeddings, words are automatically generated along with the surrounding words.

解析： we pick a given word and try to predict its surrounding words or vice versa.

In the word2vec algorithm, you estimate , where is the target word and is a context word. How are and

is the sequence of all the words in the sentence before
and
is a sequence of several words immediately before
is the one word that comes immediately before

Suppose you have a 10000 word vocabulary, and are learning 500-dimensional word embeddings. The word2vec model uses the following softmax function:

Which of these statements are correct? Check all that apply.

and
and
After training, we should expectto be very close towhenand
and

Suppose you have a 10000 word vocabulary, and are learning 500-dimensional word embeddings. The GloVe model minimizes this objective:
Which of these statements are correct? Check all that apply.

and
and
Theoretically, the weighting functionmust satisfy

You have trained word embeddings using a text dataset of words. You are considering using these word embeddings for a language task, for which you have a separate labeled dataset of

Whenis equal to
Whenis smaller than
Whenis larger than

深度學習Course5第二周Natural Language Processing & Word Embeddings習題整理

Natural Language Processing & Word Embeddings

繼續閱讀

2021-2025年中國運動療法（KT）帶行業市場供需與戰略研究報告

cs231n斯坦福基于卷積神經網絡的CV學習筆記（一）KNN和線性分類器/分類器損失/反向傳播一，KNN圖像分類算法二，線性分類器三，線性分類器損失四，反向傳播五，神經網絡

Small tricks

libsvm for python 安裝

2021年危險化學品經營機關安全管理人員考試題庫及危險化學品經營機關安全管理人員考試技巧

學習軟體測試基礎測試第七天

Zeppelin 配置通路 REST APIApache Zeppelin Configuration REST API

【Torch】最簡潔logging使用指南

27. Remove Element(清單)題目代碼

無人機--飛控科普

Cloud Studio初體驗

使用 ctypes 進行 Python 和 C 的混合程式設計

【python】【資料處理】畫多元資料分布圖

【python】netconf協定對接管理裝置

「Python 網絡自動化」NETCONF —— Python 使用 NETCONF 管理配置 H3C 網絡裝置

在python中建立excel并寫入

深度學習Course5第二周Natural Language Processing &amp; Word Embeddings習題整理

Natural Language Processing & Word Embeddings

繼續閱讀

深度學習Course5第二周Natural Language Processing & Word Embeddings習題整理