RASA(一)

machine_learner發表於2020-09-27

原文網址 : https://blog.csdn.net/Jae_q/article/details/108825492

layout: post
title: rasa_chatbot
subtitle:
date: 2020-7-24
author: RJ
header-img:
catalog: true
tags:
- NLP

環境配置

pip install -n rasa python==3.7
pip3 install rasa==2.0.0rc1
conda install pytorch==1.5.1 torchvision==0.6.1 cudatoolkit=10.1 -c pytorch
pip install "rasa[transformers]"

三種model

Chinese_models_for_SpaCy models

pip install https://github.com/explosion/spacy-models/releases//tag/zh_core_web_lg-2.3.1/zh_core_web_lg-2.3.1.tar.gz

[MITIE]

[HFTransformer] pip install “rasa[transformers]”

智慧客服機器人

[外鏈圖片轉存失敗,源站可能有防盜鏈機制,建議將圖片儲存下來直接上傳(img-btiiabny-1601180411302)(https://raw.githubusercontent.com/rejae/rejae.github.io/master/img/15964501511980.png)]

Rasa研究使用

Create a New Project
View Your NLU Training Data
Define Your Model Configuration
Write Your First Stories
Define a Domain
Train a Model
Test Your Assistant
Talk to Your Assistant

rasa init --no-prompt    （1. Create a New Project， if you use --no-prompt you will get a default project, or you could use rasa train latter）

cat data/nlu.md  (nlu.md is your train data)
cat config.yml
cat data/stories.md
cat domain.yml

rasa train
rasa test
rasa shell

Core

Stories
Domains

Responses
Actions
Reminders and External Events
Policies
Slots
Forms
Retrieval Actions
Interactive Learning
Fallback Actions
Knowledge Base Actions

Stories.md and nlu.md

data/nlu.md ‘*’ your NLU training data
data/stories.md ‘*’ your stories

1、Stories

Rasa stories are a form of training data used to train the Rasa’s dialogue management models.

A story is a representation of a conversation between a user and an AI assistant, converted into a specific format where user inputs are expressed as corresponding intents (and entities where necessary) while the responses of an assistant are expressed as corresponding action names.

##	story 標題

>> checkpoint
* intent
  - action
  - xxx_form
  - form{"name":"xxx_form"}

## story_happy
>> activate restaurant form
    - ...
* request_restaurant
    - restaurant_form
    - form{"name": "restaurant_form"}

把以上內容儲存到 stories.md檔案中

呼叫執行：

rasa train core -d domain.yml -s data/stories.md --out models -c config.yml

2、 訓練資料 nlu.md

[training-data-format](https://rasa.com/docs/rasa/nlu/training-data-format/)

The training data for Rasa NLU is structured into different parts:

- common examples  (required) 
- synonyms  
    (Synonyms will map extracted entities to the same name, for example mapping “my savings account” to simply “savings”. However, this only happens after the entities have been extracted, so you need to provide examples with the synonyms present so that Rasa can learn to pick them up)
- regex features 
    (Regex features are a tool to help the classifier detect entities or intents and improve the performance.)
- lookup tables 
    (Lookup tables may be specified as plain text files containing newline-separated words or phrases. Upon loading the training data, these files are used to generate case-insensitive regex patterns that are added to the regex features.)


synonyms: use the new format [savings account]{"entity": "source_account", "value": "savings"}

Domain.yml

domain可以理解為機器的知識庫，其中定義了意圖，動作，以及對應動作所反饋的內容。

其中槽位和實體重合度較高

intents	意圖
actions	動作
templates	回答模板
entities	實體
slots	詞槽

config.yml

language: zh
pipeline:
  - name: HFTransformersNLP
    model_name: "bert"
    model_weights: "bert-base-chinese"
    cache_dir: "D:\\model_files"
  - name: LanguageModelTokenizer
  - name: LanguageModelFeaturizer
  - name: LexicalSyntacticFeaturizer
  - name: CRFEntityExtractor
  - name: EntitySynonymMapper
  - name: DIETClassifier
    epochs: 200

policies:
  - name: "rasa.core.policies.ted_policy.TEDPolicy"
    epochs: 120
    featurizer:
      - name: MaxHistoryTrackerFeaturizer
        max_history: 5
        state_featurizer:
          - name: BinarySingleStateFeaturizer
  - name: "rasa.core.policies.memoization.MemoizationPolicy"
    max_history: 5
  - name: "rasa.core.policies.form_policy.FormPolicy"
  - name: "rasa.core.policies.mapping_policy.MappingPolicy"
  - name: "rasa.core.policies.fallback.FallbackPolicy"
    nlu_threshold: 0.4
    core_threshold: 0.3
    ambiguity_threshold: 0.05
    fallback_action_name: 'action_fallback'

# Configuration for Rasa NLU.
# https://rasa.com/docs/rasa/nlu/components/
language: zh
pipeline:
  - name: MitieNLP
    model: data/total_word_feature_extractor_zh.dat
  - name: JiebaTokenizer
    dictionary_path: D:/rasa_workspace/.rasa_p2/data/dict/user_dict.txt
  - name: MitieEntityExtractor
  - name: EntitySynonymMapper
  - name: RegexFeaturizer
  - name: MitieFeaturizer
  - name: SklearnIntentClassifier

policies:
  - name: MemoizationPolicy
  - name: TEDPolicy
    max_history: 5
    epochs: 100
  - name: MappingPolicy

reference

使用 Rasa NLU 構建一箇中文 ChatBot

基於RASA的task-orient對話系統解析（一）

基於RASA的task-orient對話系統解析（二）——對話管理核心模組

基於RASA的task-orient對話系統解析（三）——基於rasa的會議室預定對話系統例項

參考

基於中文的醫療知識圖譜的問答機器人MedicalKBQA

windows下安裝MITIE

git clone https://github.com/mit-nlp/MITIE.git
cd MITIE
python setup.py install

rasa csdn blog

rasa Bert

rasa 如何寫一個故事
2021-09-01
Rasa 聊天機器人專欄（下）
2020-02-13
機器人
Rasa 聊天機器人專欄（上）
2020-02-07
機器人
Rasa init報錯：AttributeError: type object 'Callable' has no attribute '_abc_registry'
2020-05-19
ErrorObject
Rasa中使用lookup table時針對中文對RegexEntityExtractor進行修改
2020-11-17
rasa form的中斷形式自然機器語言學習人工智慧
2021-09-02
ORM人工智慧
每日一練(一)
2018-03-29
一筆一劃教你寫一簽名
2020-11-22
一次一密
2020-07-19
一條唯一索引
2019-06-21
索引
一步一步實現一個Promise
2019-04-28
Promise
一、JVM專欄之一
2019-04-18
JVM
Mysql 一主一從配置
2024-12-04
MySql
一比一還原axios原始碼（一）—— 發起第一個請求
2022-03-15
iOS原始碼
MyBatis 使用resultMap 以及一對一和一對多
2022-12-01
MyBatis
一杯茶,一支菸,一行程式碼寫一天 !
2019-01-18
行程
一對一直播原始碼助力一對一教育，進入直播3.0時代！
2018-09-27
原始碼
一步一步來
2019-01-06
一對一聊天ajax實現
2018-06-11
L1-030 一幫一
2024-03-10
弘一法師語錄一
2024-03-31
一、一加9刷入LineageOS
2024-06-10
一步一步帶你掌握webpack(一）——入門
2019-04-15
Web
一天一個設計模式(一) - 總體概述
2018-07-13
設計模式
promise原理—一步一步實現一個promise
2019-04-27
Promise
自由職業一時爽，一直自由一直爽
2019-05-15
JPA(3) 表關聯關係(多對一、一對多、多對多、一對一)
2018-11-15
為什麼反向關聯一對一和一對多都是同一個方法
2019-12-09
什麼是一對一直播原始碼?一對一直播為何產生？
2019-03-23
原始碼
excel表格複製貼上格式怎麼能一模一樣表格怎麼複製一個一模一樣的
2022-03-19
Excel
華為雲FusionInsight MRS：助力企業構建“一企一湖，一城一湖”
2020-11-04
gorm 關係一對一,一對多,多對多查詢
2019-12-30
GoORM
一顆燈一束光一片葉很寧靜
2021-06-22
【董天一】什麼是IPFS?(一)
2018-11-14
Cherry-Pick | 一日一 Git
2019-03-04
Git
一個人像一家公司
2019-01-16
（一）你的第一個Socket程式
2018-09-23
Java聊天室——一對一模式
2018-06-03
Java模式