我的開源專案彙總(機器&深度學習、NLP、網路IO、AIML、mysql協議、chatbot)

超人汪小建發表於2018-07-12

TextAnalyzer

https://github.com/sea-boat/TextAnalyzer

A text analyzer which is based on machine learning,statistics and dictionaries that can analyze text. So far, it supports hot word extracting, text classification, part of speech tagging, named entity recognition, chinese word segment, extracting address, synonym, text clustering, word2vec model, edit distance, chinese word segment, sentence similarity,word sentiment tendency.

輕量reactor網路框架

https://github.com/sea-boat/net-reactor

image

深度學習實驗室

https://github.com/sea-boat/DeepLearning-Lab

  • rnn char test.
  • seq2seq test.
  • word2vec test.
  • cross entropy test.
  • bidirectional rnn test.

機器學習實驗室

https://github.com/sea-boat/MachineLearning_Lab

  • Least Square method.
  • Gradient Descent
  • Newton's method
  • hierarchy cluster
  • knn
  • markov
  • adaboost
  • random number generation(all kinds of distributions)

mysql協議

https://github.com/sea-boat/mysql-protocol

a lib for mysql protocol operation

how to use

i want an OK packet, just simple as below ,bytes is what you need.

OKPacket ok = new OKPacket();
ok.packetId = 2;
ok.affectedRows = 0;
ok.insertId = 0;
ok.serverStatus = 2;
ok.warningCount = 0;
ByteBuffer buffer = ByteBuffer.allocate(256);
ok.write(buffer);
buffer.flip();
byte[] bytes = new byte[buffer.remaining()];
buffer.get(bytes, 0, bytes.length);
複製程式碼

seq2seq模型chatbot

https://github.com/sea-boat/seq2seq_chatbot

a chatbot which is implemented via seq2seq model.

how to train

  1. run data.py to produce some files we needed.
  2. run train.py to train the model.
  3. run test_model.py to predict.

requirements

  • python3.5
  • tensorflow1.3

AIML聊天機器人

https://github.com/sea-boat/super-robot

a robot based on Alice Bot, which can chat with people and handle the business.

Features

  • Alice Bot for chatting.
  • Customizing corpus by AIML1.0.
  • Learning chat online.
  • Ability Bot interface for implementing all kinds of abilities.
  • Service Bot for doing business.

TODO

  • Supports AIML2.0.

自然語言處理實驗室

https://github.com/sea-boat/nlp_lab

  • doc2txt
  • tf-idf
  • cnn text classify
  • hmm cws
  • crf ner

-------------推薦閱讀------------

我的開源專案彙總(機器&深度學習、NLP、網路IO、AIML、mysql協議、chatbot)

我的2017文章彙總——機器學習篇

我的2017文章彙總——Java及中介軟體

我的2017文章彙總——深度學習篇

我的2017文章彙總——JDK原始碼篇

我的2017文章彙總——自然語言處理篇

我的2017文章彙總——Java併發篇


跟我交流,向我提問:

我的開源專案彙總(機器&深度學習、NLP、網路IO、AIML、mysql協議、chatbot)

公眾號的選單已分為“讀書總結”、“分散式”、“機器學習”、“深度學習”、“NLP”、“Java深度”、“Java併發核心”、“JDK原始碼”、“Tomcat核心”等,可能有一款適合你的胃口。

為什麼寫《Tomcat核心設計剖析》

歡迎關注:

我的開源專案彙總(機器&深度學習、NLP、網路IO、AIML、mysql協議、chatbot)

相關文章