EMNLP 2020 | 基於反事實推理的開放域生成式對話

哈工大SCIR發表於2020-11-04

原文網址 : https://www.jiqizhixin.com/articles/2020-11-04-6

論文名稱：Counterfactual Off-Policy Training for Neural Dialogue Generation
論文作者：朱慶福，張偉男，劉挺，王威廉
原創作者：朱慶福
論文連結：https://arxiv.org/abs/2004.14507
轉載須標註出處：哈工大SCIR

1. 簡介

2. 模型結構

2.1 結構因果模型（Structural Causal Model）

2.2 干預（Intervention）

2.3 反事實推理（Counterfactual Inference）

EMNLP 2020 | 基於反事實推理的開放域生成式對話

3. 實驗結果

4. 實驗分析

5. 結論

參考文獻

[1] Judea Pearl and Dana Mackenzie. 2018. The book of why: the new science of cause and effect. Basic Books.

[2] Lars Buesing, Theophane Weber, Yori Zwols, Nicolas Heess, Sebastien Racaniere, Arthur Guez, and Jean Baptiste Lespiau. 2019. Woulda, coulda, shoulda: Counterfactually-guided policy search. In Proceedings of the Seventh International Conference on Learning Representations.

[3] Michael Oberst and David Sontag. 2019. Counterfactual off-policy evaluation with gumbel-max structural causal models. In International Conference on Machine Learning, pages 4881–4890.

[4] Iulian V Serban, Alessandro Sordoni, Yoshua Bengio, Aaron Courville, and Joelle Pineau. 2016. Building end-to-end dialogue systems using generative hierarchical neural network models. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence.

[5] Jingjing Xu, Xuancheng Ren, Junyang Lin, and Xu Sun. 2018. Diversity-promoting GAN: A cross-entropy based generative adversarial network for diversified text generation. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 3940–3949.

[6] Jiwei Li, Will Monroe, Tianlin Shi, Se ́bastien Jean, Alan Ritter, and Dan Jurafsky. 2017a. Adversarial learning for neural dialogue generation. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pages 2157–2169.

[7] Yi-Lin Tuan and Hung-Yi Lee. 2019. Improving conditional sequence generative adversarial networks by stepwise evaluation. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(4):788–798.

EMNLP 2020 | 開放域對話系統的屬性一致性識別
2020-11-12
ACL 2020 | 基於稠密段落檢索的開放域問答系統技術
2020-05-29
基於文件門控制器的開放域問答
2019-05-24
基於阿里雲函式計算實現AI推理
2020-10-29
阿里函式AI
【EMNLP 2023】基於大語言模型的複雜任務認知推理演算法CogTree
2023-12-08
模型演算法
基於RocketMQ實現分散式事務
2024-03-12
MQ分散式
基於ChatGPT用AI實現自然對話
2023-04-30
ChatGPTAI
TOMG-Bench：大語言模型開放域分子生成新基準
2025-02-18
模型
MySQL 中基於 XA 實現的分散式事務
2020-04-05
MySql分散式
NVIDIA NeMo 如何支援對話式 AI 任務的訓練與推理？
2023-05-05
AI
.NET使用OllamaSharp實現大模型推理對話的簡單演示
2024-10-25
大模型
EMNLP2018-語言理解+對話系統的最新進展
2019-01-14
以文件為額外知識的生成式對話
2020-09-23
基於Seata探尋分散式事務的實現方案
2023-01-06
分散式
基於MindSpore實現BERT對話情緒識別
2024-07-16
情緒識別
DSTC10開放領域對話評估比賽冠軍方法總結
2022-01-15
MassTransit | 基於StateMachine實現Saga編排式分散式事務
2023-01-02
Mac分散式
Laravel基於reset機制實現分散式事務
2021-11-07
Laravel分散式
EMNLP 2021 | LayoutReader：基於ReadingBank的閱讀序列抽取模型
2021-11-13
模型
基於微服務框架Micronaut和Eventuate Tram實現分散式事務的開源案例
2019-10-05
微服務框架分散式
實戰與原理：如何基於RocketMQ實現分散式事務？
2024-01-29
MQ分散式
AAAI 2020 論文解讀：關於生成模型的那些事
2020-02-17
AI模型
基於CPU版本的Caffe推理框架
2020-11-16
框架
基於事理圖譜的文字推理
2020-09-28
基於訓練和推理場景下的MindStudio高精度對比
2022-12-06
基於 Agora SDK 實現 Windows 端的一對一視訊通話（基於3.6.2版本）
2022-05-17
GoWindows
Spring Cloud Seata系列：基於AT模式實現分散式事務
2023-12-13
SpringCloud模式分散式
基於CPU的深度學習推理部署優化實踐
2018-12-24
深度學習優化
php基於dtm分散式事務管理器實現tcc模式分散式事務demo
2021-12-27
PHP分散式模式
基於代碼生成器的快速開放平臺，learun框架原始碼下載
2019-09-06
框架原始碼
使用Spring實現反應式事務(Reactive Transactions)
2019-05-25
SpringReact
開放世界遊戲的環境敘事實踐：玩家沉浸式體驗的前提
2022-04-11
遊戲
基於RocketMq的分散式事務解決方案
2019-08-21
MQ分散式
如何基於文件的內容實現 AI 對話功能，以 Documate 為例
2024-02-26
AI
基於SRAM的方法可加速AI推理
2020-09-17
AI
模組化、反事實推理、特徵分離，「因果表示學習」的最新研究都在講什麼？
2020-03-28
特徵
關於運放中管子處於的區域
2024-05-22
NeurIPS 2020 | 生成式的基於動態圖網路學習的三維部件拼裝
2020-10-15

EMNLP 2020 | 基於反事實推理的開放域生成式對話

2. 模型結構

2.1 結構因果模型（Structural Causal Model）

2.2 干預（Intervention）

2.3 反事實推理（Counterfactual Inference）

3. 實驗結果

4. 實驗分析

5. 結論

參考文獻

相關文章