TextAnalyzer
https://github.com/sea-boat/TextAnalyzer
A text analyzer which is based on machine learning,statistics and dictionaries that can analyze text. So far, it supports hot word extracting, text classification, part of speech tagging, named entity recognition, chinese word segment, extracting address, synonym, text clustering, word2vec model, edit distance, chinese word segment, sentence similarity,word sentiment tendency.
輕量reactor網路框架
https://github.com/sea-boat/net-reactor
深度學習實驗室
https://github.com/sea-boat/DeepLearning-Lab
- rnn char test.
- seq2seq test.
- word2vec test.
- cross entropy test.
- bidirectional rnn test.
機器學習實驗室
https://github.com/sea-boat/MachineLearning_Lab
- Least Square method.
- Gradient Descent
- Newton's method
- hierarchy cluster
- knn
- markov
- adaboost
- random number generation(all kinds of distributions)
mysql協議
https://github.com/sea-boat/mysql-protocol
a lib for mysql protocol operation
how to use
i want an OK packet, just simple as below ,bytes is what you need.
OKPacket ok = new OKPacket();
ok.packetId = 2;
ok.affectedRows = 0;
ok.insertId = 0;
ok.serverStatus = 2;
ok.warningCount = 0;
ByteBuffer buffer = ByteBuffer.allocate(256);
ok.write(buffer);
buffer.flip();
byte[] bytes = new byte[buffer.remaining()];
buffer.get(bytes, 0, bytes.length);
複製程式碼
seq2seq模型chatbot
https://github.com/sea-boat/seq2seq_chatbot
a chatbot which is implemented via seq2seq model.
how to train
- run
data.py
to produce some files we needed. - run
train.py
to train the model. - run
test_model.py
to predict.
requirements
- python3.5
- tensorflow1.3
AIML聊天機器人
https://github.com/sea-boat/super-robot
a robot based on Alice Bot, which can chat with people and handle the business.
Features
- Alice Bot for chatting.
- Customizing corpus by AIML1.0.
- Learning chat online.
- Ability Bot interface for implementing all kinds of abilities.
- Service Bot for doing business.
TODO
- Supports AIML2.0.
自然語言處理實驗室
https://github.com/sea-boat/nlp_lab
- doc2txt
- tf-idf
- cnn text classify
- hmm cws
- crf ner
-------------推薦閱讀------------
我的開源專案彙總(機器&深度學習、NLP、網路IO、AIML、mysql協議、chatbot)
跟我交流,向我提問:
公眾號的選單已分為“讀書總結”、“分散式”、“機器學習”、“深度學習”、“NLP”、“Java深度”、“Java併發核心”、“JDK原始碼”、“Tomcat核心”等,可能有一款適合你的胃口。
歡迎關注: