distributed_skipgram_mixture

Distributed skipgram mixture model for multisense word embedding

  • 所有者: microsoft/distributed_skipgram_mixture
  • 平台:
  • 許可證: MIT License
  • 分類:
  • 主題:
  • 喜歡:
    0
      比較:

Github星跟蹤圖

Distributed Multisense Word Embedding

The Distributed Multisense Word Embedding(DMWE) tool is a parallelization of the Skip-Gram Mixture [1] algorithm on top of the DMTK parameter server. It provides an efficient "scaling to industry size" solution for multi sense word embedding.

For more details, please view our website http://www.dmtk.io

Download

$ git clone https://github.com/Microsoft/distributed_skipgram_mixture

Build

Prerequisite

DMWE is built on top of the DMTK parameter sever, therefore please download and build DMTK first (https://github.com/Microsoft/multiverso).

For Windows

Open windows\distributed_skipgram_mixture\distributed_skipgram_mixture.sln using Visual Studio 2013. Add the necessary include path (for example, the path for DMTK multiverso) and lib path. Then build the solution.

For Ubuntu (Tested on Ubuntu 12.04)

Download and build by running $ sh build.sh. Modify the include and lib path in Makefile. Then run $ make all -j4.

Run

For parameter settings, see scripts/parameters_settings.txt. For running it, see the example script scripts/run.py.

Reference

[1] Tian, F., Dai, H., Bian, J., Gao, B., Zhang, R., Chen, E., & Liu, T. Y. (2014). A probabilistic model for learning multi-prototype word embeddings. In Proceedings of COLING (pp. 151-160).

Microsoft Open Source Code of Conduct

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

概覽

名稱與所有者microsoft/distributed_skipgram_mixture
主編程語言C++
編程語言Makefile (語言數: 5)
平台
許可證MIT License
發布數0
創建於2015-10-14 02:58:05
推送於2016-06-23 03:29:52
最后一次提交2016-06-23 11:29:52
星數115
關注者數31
派生數50
提交數49
已啟用問題?
問題數5
打開的問題數5
拉請求數1
打開的拉請求數0
關閉的拉請求數1
已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?
去到頂部