matlab_word2vec_binary_reader

バージョン 1.2 (6.15 KB) 作成者: Toru Ikegami

readW2Vbin - MATLAB utility to read binary word2vec embedding model file

https://github.com/mathworks/matlab_word2vec_binary_reader

フォロー

5.0

(2)

ダウンロード: 63

更新 2020/6/25

GitHub でライセンスを表示

Use `readW2Vbin` to read a pre-trained word2vec word embedding model in the binary format. It assumes that the file is written in the following format.

- The data before the first `0x20` (space) are ascii characters representing the number of vocabularies of the model , while the data between the first `0x20` and the first `0x10` (newline) represent the dimension of the word vector. (e.g.,`[ 51 48 48 48 48 48 48 32 51 48 48 10] ` means 3 milion words embedded into 300 dimensions. )
- The main body, which consists of sequence of word-vector pairs, begins right after the newline character. One word-vector pair consists of a sequence of bytes that represents a word, space (0x20), and a sequence of binary data that represents the embedded vector corresponding to the word in single precision (32bit) format. The length of the vector data is 4bytes times number of dimensions (e.g., 1200 bytes for 300 dimension).

This function was tested with the "GoogleNews-vectors-negative300.bin" from the word2vec web (https://code.google.com/archive/p/word2vec/). It took about a minute to read the 3.5GB file.

引用

Toru Ikegami (2026). matlab_word2vec_binary_reader (https://github.com/mathworks/matlab_word2vec_binary_reader/releases/tag/v1.2), GitHub. 取得日: 2026/3/18.

MATLAB リリースの互換性

作成: R2019b

R2019b 以降 R2020a 以前と互換性あり

プラットフォームの互換性

Windows macOS Linux

タグタグを追加

新しいタブで開く

バージョン	公開済み	リリースノート
1.2	2020/6/25	See release notes for this release on GitHub: https://github.com/mathworks/matlab_word2vec_binary_reader/releases/tag/v1.2	ダウンロード
1.1	2020/6/25	See release notes for this release on GitHub: https://github.com/mathworks/matlab_word2vec_binary_reader/releases/tag/v1.1	ダウンロード
1.0	2020/6/23		ダウンロード

この GitHub アドオンでの問題を表示または報告するには、GitHub リポジトリにアクセスしてください。

matlab_word2vec_binary_reader

引用

必須

MATLAB リリースの互換性

プラットフォームの互換性

タグタグを追加

ライブエディターを体験する

matlab_word2vec_bin​ary_reader

引用

必須

MATLAB リリースの互換性

プラットフォームの互換性

タグ タグを追加

ライブ エディターを体験する

matlab_word2vec_binary_reader

タグタグを追加

ライブエディターを体験する