PhD thesis,

Statistical Dialogue Modelling

M. Gasić.
Department of Engineering, University of Cambridge, Cambridge, UK, PhD thesis, (January 2011)

Abstract

The partially observable Markov decision process (POMDP) has been proposed as a model for dialogue which is able to provide increased robustness to errors in understanding of speech, automatically optimise dialogue management behaviour and be amenable to adaptation for different user types. The POMDP-based approach to dialogue management maintains a distribution over every possible dialogue state, the belief state. Based on that distribution the system chooses the action that gives the highest expected reward, where the reward provides a measure of how good the dialogue is. The primary challenge, however, with the POMDP-based approach is the intractability of both maintaining the belief state and of optimising action selection. The Hidden Information State framework is a practical framework for building dialogue managers based on the POMDP approach. It achieves tractability by grouping the possible user goals into equivalence classes which then ensures that the belief state can be maintained tractably. It optimises the dialogue policy in a much reduced belief state space, the summary space. In this thesis, a more efficient state representation is presented which includes the representation of logical complements of concepts in the user request. On the one hand, the representation supports more complex dialogues that include logical expressions. On the other hand, it enables a pruning technique to be implemented which is able to place a bound on the space. Thus, no limit is required on the length of the dialogue or on the number of different hypotheses that are received from the speech understanding module. More importantly, this enables building real-world dialogue systems with large domains. This thesis also examines the potential for improving the action selection. Firstly, the problem of optimising action selection in the summary space is examined. A method is then proposed that guarantees selection of optimal back-off actions in the case when the selected action cannot be mapped back to the original belief state space. Secondly, this thesis investigates the use of Gaussian processes to approximate the highest expected reward that can be obtained for every belief state and system action. Approximating the function with a Gaussian process provides a posterior distribution of the function values given the prior distribution and some observations. It is shown here that an adequate prior speeds up the optimisation of action selection. The posterior also provides an estimate of the uncertainty, which enables rapid adaptation to different user profiles. Overall, the methods proposed in this thesis make steps towards more flexible real-world spoken dialogue systems.

BibTeX key: Gasic11Phd
entry type: phdthesis
address: Cambridge, UK
year: 2011
month: #jan#
school: Department of Engineering, University of Cambridge
type: PhD thesis
file: Author home page:2011/Gasic11Phd.pdf:PDF
groups: public
intrahash: c196be4b02930ca4226d33306b9caaac
timestamp: 2011.01.01
username: flint63
Document: http://mi.eng.cam.ac.uk/~mg436/papers/gasic-thesis-printing.pdf

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

%0 Thesis %1 Gasic11Phd %A Gasić, Milica %C Cambridge, UK %D 2011 %K 01801 103 book numerical ai user interaction dialog language processing learn zzz.sds %T Statistical Dialogue Modelling %U http://mi.eng.cam.ac.uk/~mg436/papers/gasic-thesis-printing.pdf %X The partially observable Markov decision process (POMDP) has been proposed as a model for dialogue which is able to provide increased robustness to errors in understanding of speech, automatically optimise dialogue management behaviour and be amenable to adaptation for different user types. The POMDP-based approach to dialogue management maintains a distribution over every possible dialogue state, the belief state. Based on that distribution the system chooses the action that gives the highest expected reward, where the reward provides a measure of how good the dialogue is. The primary challenge, however, with the POMDP-based approach is the intractability of both maintaining the belief state and of optimising action selection. The Hidden Information State framework is a practical framework for building dialogue managers based on the POMDP approach. It achieves tractability by grouping the possible user goals into equivalence classes which then ensures that the belief state can be maintained tractably. It optimises the dialogue policy in a much reduced belief state space, the summary space. In this thesis, a more efficient state representation is presented which includes the representation of logical complements of concepts in the user request. On the one hand, the representation supports more complex dialogues that include logical expressions. On the other hand, it enables a pruning technique to be implemented which is able to place a bound on the space. Thus, no limit is required on the length of the dialogue or on the number of different hypotheses that are received from the speech understanding module. More importantly, this enables building real-world dialogue systems with large domains. This thesis also examines the potential for improving the action selection. Firstly, the problem of optimising action selection in the summary space is examined. A method is then proposed that guarantees selection of optimal back-off actions in the case when the selected action cannot be mapped back to the original belief state space. Secondly, this thesis investigates the use of Gaussian processes to approximate the highest expected reward that can be obtained for every belief state and system action. Approximating the function with a Gaussian process provides a posterior distribution of the function values given the prior distribution and some observations. It is shown here that an adequate prior speeds up the optimisation of action selection. The posterior also provides an estimate of the uncertainty, which enables rapid adaptation to different user profiles. Overall, the methods proposed in this thesis make steps towards more flexible real-world spoken dialogue systems.

@phdthesis{Gasic11Phd, abstract = {The partially observable Markov decision process (POMDP) has been proposed as a model for dialogue which is able to provide increased robustness to errors in understanding of speech, automatically optimise dialogue management behaviour and be amenable to adaptation for different user types. The POMDP-based approach to dialogue management maintains a distribution over every possible dialogue state, the belief state. Based on that distribution the system chooses the action that gives the highest expected reward, where the reward provides a measure of how good the dialogue is. The primary challenge, however, with the POMDP-based approach is the intractability of both maintaining the belief state and of optimising action selection. The Hidden Information State framework is a practical framework for building dialogue managers based on the POMDP approach. It achieves tractability by grouping the possible user goals into equivalence classes which then ensures that the belief state can be maintained tractably. It optimises the dialogue policy in a much reduced belief state space, the summary space. In this thesis, a more efficient state representation is presented which includes the representation of logical complements of concepts in the user request. On the one hand, the representation supports more complex dialogues that include logical expressions. On the other hand, it enables a pruning technique to be implemented which is able to place a bound on the space. Thus, no limit is required on the length of the dialogue or on the number of different hypotheses that are received from the speech understanding module. More importantly, this enables building real-world dialogue systems with large domains. This thesis also examines the potential for improving the action selection. Firstly, the problem of optimising action selection in the summary space is examined. A method is then proposed that guarantees selection of optimal back-off actions in the case when the selected action cannot be mapped back to the original belief state space. Secondly, this thesis investigates the use of Gaussian processes to approximate the highest expected reward that can be obtained for every belief state and system action. Approximating the function with a Gaussian process provides a posterior distribution of the function values given the prior distribution and some observations. It is shown here that an adequate prior speeds up the optimisation of action selection. The posterior also provides an estimate of the uncertainty, which enables rapid adaptation to different user profiles. Overall, the methods proposed in this thesis make steps towards more flexible real-world spoken dialogue systems.}, added-at = {2018-03-21T09:31:48.000+0100}, address = {Cambridge, UK}, author = {Ga{\v{s}}i{\'c}, Milica}, biburl = {https://www.bibsonomy.org/bibtex/2c196be4b02930ca4226d33306b9caaac/flint63}, file = {Author home page:2011/Gasic11Phd.pdf:PDF}, groups = {public}, interhash = {cc78f4a7a7fa448d9564489166994b04}, intrahash = {c196be4b02930ca4226d33306b9caaac}, keywords = {01801 103 book numerical ai user interaction dialog language processing learn zzz.sds}, month = {#jan#}, school = {Department of Engineering, University of Cambridge}, timestamp = {2018-04-16T12:26:10.000+0200}, title = {Statistical Dialogue Modelling}, type = {PhD thesis}, url = {http://mi.eng.cam.ac.uk/~mg436/papers/gasic-thesis-printing.pdf}, username = {flint63}, year = 2011 }

BibSonomy

Statistical Dialogue Modelling

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on