social network

Dialogues of fictitious persons will help chat bots to maintain a conversation about themselves

Her / Warner Bros. 2013

Researchers from Facebook introduced Persona-Chat, a database consisting of more than 160,000 fragments of dialogues of real people who spoke on behalf of fictitious persons. The developers also trained a computer model on the example of these data: in the future, as reported in the preprint published on arXiv, it can help personify chat bots and voice assistants.

Despite the fact that voice assistants and chat-bots are getting more and more perfect every year, they can still support the conversation with difficulty. The reason for this is the limitations of the training sample: the computer can be taught to talk and even answer questions, but it may not be enough to maintain a seamless conversation. The chat-bot, for example, does not have a formed personality and interests – and therefore can not answer complex open questions like “what do you like to do on a rainy day?”. Of course, there is an option in which a chat-bot or voice assistant answers an unknown question with an abstract phrase like “I do not know” or a search query – but it can not be considered a conversation. In addition, when communicating with such an interlocutor retains in memory only a small snippet of dialogue and can not return to the information provided to him a few steps back.

Developers from Facebook AI Research, led by Jason Weston, have collected a database of more than a thousand “persons”: small (five sentences of not more than 15 words each) descriptions of abstract personalities invented by real people. For example, a description of such a “persona” might look like this:

“I am a vegetarian. I like swimming. My father used to work for Ford. My favorite band is Maroon5. I got a new job last month, which is about advertising design. ”
“I am a vegetarian. I like swimming. My father worked at Ford. My favorite band is Maroon5. A month ago I started working as a designer in an advertising agency “

In total there are 1155 such “persons” in the database. In addition, the researchers asked another group of people to evaluate the resulting “people” and rewrite them on the basis of similar, related characteristics: for example, the love for Maroon5 could be replaced in the “persona” for the love of the song “She Will Be Loved”, and the statement about that the father of “persona” worked for Ford, could turn into the statement that a close relative worked in an automobile company. This is necessary in order to expand the facts known about the “person” in order to draw up a possible dialogue.After this, two people who provided “persons” were asked to talk a little: each of them was assigned an occasional “persona”. As a result, 164,356 statements were found in the database of dialogues, and on the basis of this database researchers trained several computer models (generative Seq2Seqand ranking Memory Networks ). The resulting models of chat bots were then evaluated by asking them to talk with real people: the dialogue could be built around either the “person” of the computer, or around the “person” of the person, or around both participants in the dialogue. The developed models bypassed the algorithm, trained dialogues on quotes from films, on fluency, involvement in dialogue and completeness of conversation.

An example of a person dialogue (PERSON 1) and a chat-bot trained on the collected database

[PERSON 1:] I just got done playing soccer. What have you been up to?

[PERSON 2:] I do not have pets. I spend my free time writing. what about you?

[PERSON 1:] Just graduated from college. Looking for work.

[PERSON 2:] yes I graduated, in writing and poetry.

[PERSON 1:] Have you had anything published?

[PERSON 2:] I mainly write fiction novels.

[PERSON 1:] Do you like to watch sports?

[PERSON 2:] do you like kings of leon

[PERSON 1:] Are you married? I will be married soon.

[PERSON 2:] haha, no time. I have got a novel to finish.

[PERSON 1:] What is your new about?

[PERSON 2:] it is a love story. I really enjoy writing. do you write or read?

[PERSON 1:] Like to read. Spend most time watching football.

[PERSON 2:] yes I do I am and author for a living

[PERSON 1:] I just finished playing football. And what did you do?

[PERSON 2:] I do not have any pets. In my spare time I write. And you?

[PERSON 1:] I just finished studying. I’m looking for work.

[PERSON 2:] Yes, I finished studying, I studied poetry and prose

[PERSON 1:] Have you published anything yet?

[PERSON 2:] I mostly write science fiction

[PERSON 1:] Do you like to watch sports?

[PERSON 2:] Do you like Kings of Leon? My favorite song is “Use Somebody”

[PERSON 1:] Are you married? I’m going to have a wedding soon

[PERSON 2:] Haha, no, I do not have time for this. It is necessary to finish the book

[PERSON 1:] What’s your book about?

[PERSON 2:] About love. I really love writing. Do you prefer to write or read?

[PERSON 1:] I love to read, but generally – I watch football more often

[PERSON 2:] Yes, I make a living by writing

The developers note that the collected database can be useful for creating new and improving old models of chat bots and voice assistants. The database is also available for public access.Maintaining conversations on abstract topics is more like an addition to voice assistants and chat bots. And their main function, in addition to controlling something and recognizing voice commands, can even be the provision of psychological support: as Woebot, which uses methods of cognitive-behavioral psychotherapy when communicating with users .

Back to top button
Close
Close