how to prepare data for domain specific chat-bot

up vote
1
down vote

favorite

I am trying to make a chatbot. all the chatbots are made of structure data. I looked Rasa, IBM watson and other famous bots. Is there any ways that we can convert the un-structured data into some sort of structure, which can be used for bot training? Let's consider bellow paragraph-

Packaging unit
A packaging unit is used to combine a certain quantity of identical items to form a group. The quantity specified here is then used when printing the item labels so that you do not have to label items individually when the items are not managed by serial number or by batch. You can also specify the dimensions of the packaging unit here and enable and disable them separately for each item.

It is possible to store several EAN numbers per packaging unit since these numbers may differ for each packaging unit even when the packaging units are identical. These settings can be found on the Miscellaneous tab: There are also two more settings in the system settings that are relevant to mobile data entry:

When creating a new item, the item label should be printed automatically. For this reason, we have added the option ‘Print item label when creating new storage locations’ to the settings. When using mobile data entry devices, every item should be assigned to a storage location, where an item label is subsequently printed that should be applied to the shelf in the warehouse to help identify the item faster.

how to make the bot from such a data any lead would be highly appreciated. Thanks!
is this idea in picture will work?just_a_thought

edited 2 days ago

asked Nov 6 at 12:21

Niraj D Pandey

112

add a comment |

up vote
1
down vote

favorite

how to make the bot from such a data any lead would be highly appreciated. Thanks!
is this idea in picture will work?just_a_thought

edited 2 days ago

asked Nov 6 at 12:21

Niraj D Pandey

112

add a comment |

up vote
1
down vote

favorite

how to make the bot from such a data any lead would be highly appreciated. Thanks!
is this idea in picture will work?just_a_thought

edited 2 days ago

asked Nov 6 at 12:21

Niraj D Pandey

112

how to make the bot from such a data any lead would be highly appreciated. Thanks!
is this idea in picture will work?just_a_thought

ibm-watson chatterbot rasa-core chatfuel

edited 2 days ago

asked Nov 6 at 12:21

Niraj D Pandey

112

edited 2 days ago

asked Nov 6 at 12:21

Niraj D Pandey

112

edited 2 days ago

asked Nov 6 at 12:21

Niraj D Pandey

112

asked Nov 6 at 12:21

Niraj D Pandey

112

asked Nov 6 at 12:21

Niraj D Pandey

112

add a comment |

1 Answer
1

active

oldest

votes

up vote
1
down vote

The data you are showing seems to be a good candidate for a passage search. Basically, you would like to answer user question by the most relevant paragraph found in your training data. This uses-case is handled by Watson Discovery service that can analyze unstructured data as you are providing and then you can query the service with input text and the service answers with the closest passage found in the data.

From my experience you also get a good results by implementing your own custom TF/IDF algorithm tailored for your use-case (TF/IDF is a nice similarity search tackling e.g. the stopwords for you).

Now if your goal would be to bootstrap a rule based chatbot using these kind of data then these data are not that ideal. For rule-based chatbot the best data would be some actual conversations between users asking questions about the target domain and the answers by some subject matter expert. Using these data you might be able to at least do some analysis helping you to pinpoint the relevant topics and domains the chatbot should handle however - I think - you will have hard time using these data to bootstrap a set of intents (questions the users will ask) for the rule based chatbot.

TLDR
If I would like to use Watson service, I would start with Watson Discovery. Alternatively, I would implement my own search algorithm starting with TF/IDF (which maps rather nicely to your proposed solution).

answered Nov 6 at 13:21

Michal Bida

922322

add a comment |

Your Answer

StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53171803%2fhow-to-prepare-data-for-domain-specific-chat-bot%23new-answer', 'question_page');

);

Post as a guest

Name

1 Answer
1

active

oldest

votes

1 Answer
1

active

oldest

votes

up vote
1
down vote

From my experience you also get a good results by implementing your own custom TF/IDF algorithm tailored for your use-case (TF/IDF is a nice similarity search tackling e.g. the stopwords for you).

answered Nov 6 at 13:21

Michal Bida

922322

add a comment |

up vote
1
down vote

From my experience you also get a good results by implementing your own custom TF/IDF algorithm tailored for your use-case (TF/IDF is a nice similarity search tackling e.g. the stopwords for you).

answered Nov 6 at 13:21

Michal Bida

922322

add a comment |

up vote
1
down vote

From my experience you also get a good results by implementing your own custom TF/IDF algorithm tailored for your use-case (TF/IDF is a nice similarity search tackling e.g. the stopwords for you).

answered Nov 6 at 13:21

Michal Bida

922322

From my experience you also get a good results by implementing your own custom TF/IDF algorithm tailored for your use-case (TF/IDF is a nice similarity search tackling e.g. the stopwords for you).

answered Nov 6 at 13:21

Michal Bida

922322

answered Nov 6 at 13:21

Michal Bida

922322

answered Nov 6 at 13:21

Michal Bida

922322

answered Nov 6 at 13:21

Michal Bida

922322

add a comment |

draft saved

draft discarded

draft saved

draft discarded

Post as a guest

Name

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Dfyjkt