Doktorarbeit,

Statistical Dialogue Modelling

M. Gasić.
Department of Engineering, University of Cambridge, Cambridge, UK, PhD thesis, (Januar 2011)

Zusammenfassung

The partially observable Markov decision process (POMDP) has been proposed as a model for dialogue which is able to provide increased robustness to errors in understanding of speech, automatically optimise dialogue management behaviour and be amenable to adaptation for different user types. The POMDP-based approach to dialogue management maintains a distribution over every possible dialogue state, the belief state. Based on that distribution the system chooses the action that gives the highest expected reward, where the reward provides a measure of how good the dialogue is. The primary challenge, however, with the POMDP-based approach is the intractability of both maintaining the belief state and of optimising action selection. The Hidden Information State framework is a practical framework for building dialogue managers based on the POMDP approach. It achieves tractability by grouping the possible user goals into equivalence classes which then ensures that the belief state can be maintained tractably. It optimises the dialogue policy in a much reduced belief state space, the summary space. In this thesis, a more efficient state representation is presented which includes the representation of logical complements of concepts in the user request. On the one hand, the representation supports more complex dialogues that include logical expressions. On the other hand, it enables a pruning technique to be implemented which is able to place a bound on the space. Thus, no limit is required on the length of the dialogue or on the number of different hypotheses that are received from the speech understanding module. More importantly, this enables building real-world dialogue systems with large domains. This thesis also examines the potential for improving the action selection. Firstly, the problem of optimising action selection in the summary space is examined. A method is then proposed that guarantees selection of optimal back-off actions in the case when the selected action cannot be mapped back to the original belief state space. Secondly, this thesis investigates the use of Gaussian processes to approximate the highest expected reward that can be obtained for every belief state and system action. Approximating the function with a Gaussian process provides a posterior distribution of the function values given the prior distribution and some observations. It is shown here that an adequate prior speeds up the optimisation of action selection. The posterior also provides an estimate of the uncertainty, which enables rapid adaptation to different user profiles. Overall, the methods proposed in this thesis make steps towards more flexible real-world spoken dialogue systems.

BibTeX-Schlüssel: Gasic11Phd
Eintragstyp: phdthesis
Adresse: Cambridge, UK
Jahr: 2011
Monat: #jan#
Hochschule: Department of Engineering, University of Cambridge
Typ: PhD thesis
file: Author home page:2011/Gasic11Phd.pdf:PDF
groups: public
intrahash: c196be4b02930ca4226d33306b9caaac
timestamp: 2011.01.01
username: flint63
Dokument: http://mi.eng.cam.ac.uk/~mg436/papers/gasic-thesis-printing.pdf

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Bitte melden Sie sich an um selbst Rezensionen oder Kommentare zu erstellen.

Zitieren Sie diese Publikation

%0 Thesis %1 Gasic11Phd %A Gasić, Milica %C Cambridge, UK %D 2011 %K 01801 103 book numerical ai user interaction dialog language processing learn zzz.sds %T Statistical Dialogue Modelling %U http://mi.eng.cam.ac.uk/~mg436/papers/gasic-thesis-printing.pdf %X The partially observable Markov decision process (POMDP) has been proposed as a model for dialogue which is able to provide increased robustness to errors in understanding of speech, automatically optimise dialogue management behaviour and be amenable to adaptation for different user types. The POMDP-based approach to dialogue management maintains a distribution over every possible dialogue state, the belief state. Based on that distribution the system chooses the action that gives the highest expected reward, where the reward provides a measure of how good the dialogue is. The primary challenge, however, with the POMDP-based approach is the intractability of both maintaining the belief state and of optimising action selection. The Hidden Information State framework is a practical framework for building dialogue managers based on the POMDP approach. It achieves tractability by grouping the possible user goals into equivalence classes which then ensures that the belief state can be maintained tractably. It optimises the dialogue policy in a much reduced belief state space, the summary space. In this thesis, a more efficient state representation is presented which includes the representation of logical complements of concepts in the user request. On the one hand, the representation supports more complex dialogues that include logical expressions. On the other hand, it enables a pruning technique to be implemented which is able to place a bound on the space. Thus, no limit is required on the length of the dialogue or on the number of different hypotheses that are received from the speech understanding module. More importantly, this enables building real-world dialogue systems with large domains. This thesis also examines the potential for improving the action selection. Firstly, the problem of optimising action selection in the summary space is examined. A method is then proposed that guarantees selection of optimal back-off actions in the case when the selected action cannot be mapped back to the original belief state space. Secondly, this thesis investigates the use of Gaussian processes to approximate the highest expected reward that can be obtained for every belief state and system action. Approximating the function with a Gaussian process provides a posterior distribution of the function values given the prior distribution and some observations. It is shown here that an adequate prior speeds up the optimisation of action selection. The posterior also provides an estimate of the uncertainty, which enables rapid adaptation to different user profiles. Overall, the methods proposed in this thesis make steps towards more flexible real-world spoken dialogue systems.

@phdthesis{Gasic11Phd, abstract = {The partially observable Markov decision process (POMDP) has been proposed as a model for dialogue which is able to provide increased robustness to errors in understanding of speech, automatically optimise dialogue management behaviour and be amenable to adaptation for different user types. The POMDP-based approach to dialogue management maintains a distribution over every possible dialogue state, the belief state. Based on that distribution the system chooses the action that gives the highest expected reward, where the reward provides a measure of how good the dialogue is. The primary challenge, however, with the POMDP-based approach is the intractability of both maintaining the belief state and of optimising action selection. The Hidden Information State framework is a practical framework for building dialogue managers based on the POMDP approach. It achieves tractability by grouping the possible user goals into equivalence classes which then ensures that the belief state can be maintained tractably. It optimises the dialogue policy in a much reduced belief state space, the summary space. In this thesis, a more efficient state representation is presented which includes the representation of logical complements of concepts in the user request. On the one hand, the representation supports more complex dialogues that include logical expressions. On the other hand, it enables a pruning technique to be implemented which is able to place a bound on the space. Thus, no limit is required on the length of the dialogue or on the number of different hypotheses that are received from the speech understanding module. More importantly, this enables building real-world dialogue systems with large domains. This thesis also examines the potential for improving the action selection. Firstly, the problem of optimising action selection in the summary space is examined. A method is then proposed that guarantees selection of optimal back-off actions in the case when the selected action cannot be mapped back to the original belief state space. Secondly, this thesis investigates the use of Gaussian processes to approximate the highest expected reward that can be obtained for every belief state and system action. Approximating the function with a Gaussian process provides a posterior distribution of the function values given the prior distribution and some observations. It is shown here that an adequate prior speeds up the optimisation of action selection. The posterior also provides an estimate of the uncertainty, which enables rapid adaptation to different user profiles. Overall, the methods proposed in this thesis make steps towards more flexible real-world spoken dialogue systems.}, added-at = {2018-03-21T09:31:48.000+0100}, address = {Cambridge, UK}, author = {Ga{\v{s}}i{\'c}, Milica}, biburl = {https://www.bibsonomy.org/bibtex/2c196be4b02930ca4226d33306b9caaac/flint63}, file = {Author home page:2011/Gasic11Phd.pdf:PDF}, groups = {public}, interhash = {cc78f4a7a7fa448d9564489166994b04}, intrahash = {c196be4b02930ca4226d33306b9caaac}, keywords = {01801 103 book numerical ai user interaction dialog language processing learn zzz.sds}, month = {#jan#}, school = {Department of Engineering, University of Cambridge}, timestamp = {2018-04-16T12:26:10.000+0200}, title = {Statistical Dialogue Modelling}, type = {PhD thesis}, url = {http://mi.eng.cam.ac.uk/~mg436/papers/gasic-thesis-printing.pdf}, username = {flint63}, year = 2011 }

BibSonomy

Statistical Dialogue Modelling

Zusammenfassung

Tags

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf