salesforce
diff --git a/‎README.md
Lines changed: 16 additions & 16 deletions b/‎README.md
Lines changed: 16 additions & 16 deletions
diff --git a/‎db/botsim_sqlite_demo.db
20 KB b/‎db/botsim_sqlite_demo.db
20 KB
diff --git a/‎docs/BotSIM_Performance_Report.png
-1.74 MB b/‎docs/BotSIM_Performance_Report.png
-1.74 MB
diff --git a/‎docs/BotSIM_App.png renamed to ‎docs/_static/BotSIM_App.png b/‎docs/BotSIM_App.png renamed to ‎docs/_static/BotSIM_App.png
diff --git a/‎docs/advanced_usage.rst
Lines changed: 4 additions & 4 deletions b/‎docs/advanced_usage.rst
Lines changed: 4 additions & 4 deletions
diff --git a/‎docs/dashboard.rst
Lines changed: 15 additions & 8 deletions b/‎docs/dashboard.rst
Lines changed: 15 additions & 8 deletions
@@ -18,7 +18,7 @@
 <a href="https://arxiv.org/abs/2211.11982">Technical Report</a>,
 <a href="https://salesforce-botsim.herokuapp.com/">Demo</a>,
 <a href="https://opensource.salesforce.com/botsim//latest/index.html">Documentation</a>,
-<a href="">Blog</a>
+<a href="https://">Blog</a>
 </div>
 
 
@@ -35,8 +35,8 @@
 
 
 ## Introduction
-BotSIM is a Bot SIMulation toolkit for performing large-scale data-efficient end-to-end evaluation, diagnosis and remediation of commercial task-oriented dialog (TOD) systems to accelerate bot development and evaluation, reduce cost and time-to-market.
-As a modular framework, BotSIM can be extended by bot developers to support new bot platforms. As a toolkit, it offers an easy-to-use App and a suite of command line tools for bot admins or practitioners to readily perform evaluation and remediation of their bots.
+BotSIM is a Bot SIMulation toolkit for performing large-scale data-efficient end-to-end evaluation, diagnosis and remediation of commercial task-oriented dialog (TOD) systems, a.k.a. "Chatbots".
+As a modular framework, BotSIM can be extended by bot developers to support new bot platforms. As a toolkit, it offers an easy-to-use App and a suite of command line tools for bot admins or practitioners to readily perform evaluation and remediation of their bots at scale. Consequently, BotSIM can accelerate bot development and evaluation, reduce cost and time-to-market.
 
 Key features of BotSIM include:
 
@@ -57,15 +57,15 @@ Key features of BotSIM include:
 2. Cloning and building dependencies
 ``` bash
    git clone https://github.com/salesforce/botsim.git
-   cd BotSIM
+   cd botsim
    pip install .
 ```
 
 ## Getting Started
 ### Streamlit Web App
-The most straightforward way of getting started with BotSIM is the Streamlit Web App. The app is developed as a multi-page app to guide users to leverage BotSIM's "generation-simulation-remediation" pipeline for evaluation, diagnosis and remediation of their bots.     
+The most straightforward way of getting started with BotSIM is the Streamlit Web App. The multi-page App is developed to guide users to leverage BotSIM's "generation-simulation-remediation" pipeline for evaluation, diagnosis and remediation of their bots.     
 <p align="center" width="100%">
-    <img width="100%" src="docs/BotSIM_App.png">
+    <img width="100%" src="docs/_static/BotSIM_App.png">
 </p>
 
 The following commands can be used to start the Streamlit Web App locally:
@@ -82,22 +82,22 @@ The App can also be deployed as a docker image:
   docker run -p 8501:8501 botsim-streamlit
 ```
 ### Command Line Tools
-Alternatively, users can also use the command line tools to deep-dive into BotSIM's generation-simulation-remediation pipeline.
+Alternatively, users can also deep-dive to learn more about BotSIM's system components through the command line tools. Details are given in the [tutorial section](https://opensource.salesforce.com/botsim//latest/tutorials.html#botsim-command-line-tools) of the code documentation(https://opensource.salesforce.com/botsim//latest/tutorials.html).
 
 ## Tutorial
-We provide the following tutorials in the [tutorial section](https:///latest/tutorials.html) of the [code documentation](). 
-- [Streamlit Web App](https://latest/tutorials.html#streamlit-web-app)
-- [BotSIM command line tools](https://latest/tutorials.html#botsim-command-line-tools)
-- [Bot health dashboard navigation](https://atest/dashboard.html)
-- [Applying remedidation suggestions](https://latest/dashboard.html#apply-intent-model-remediation-suggestions)
+We provide the following tutorials in the [code documentation](https://opensource.salesforce.com/botsim//latest/tutorials.html). 
+- [Streamlit Web App](https://opensource.salesforce.com/botsim//latest/tutorials.html#streamlit-web-app)
+- [Command Line Tools](https://opensource.salesforce.com/botsim//latest/tutorials.html#botsim-command-line-tools)
+- [Bot Health Dashboard Navigation](https://opensource.salesforce.com/botsim//latest/dashboard.html)
+- [Applying Remedidation Suggestions](https://opensource.salesforce.com/botsim//latest/dashboard.html#apply-intent-model-remediation-suggestions)
 
 ## Documentation 
-For more details of the system components and advanced usages, please refer to [code documentation]((https://opensource.salesforce.com/botsim//latest/index.html#)]).
+For more details of the system components and advanced usages, please refer to the [code documentation](https://opensource.salesforce.com/botsim//latest/index.html#).
 We welcome the contribution from the open-source community to improve the toolkit! To support new bot platforms, please also follow the guidelines detailed in the code documentation.
 
 ## System Demo Paper and Technical Report
-You can find more details in our technical report and  system demo paper.
-If you're using BotSIM in your research or applications, please cite using this BibTeX for technical report:
+You can find more details of system designs in our technical report. Detailed system descriptions are given in our EMNLP 2022 system demo paper.
+If you're using BotSIM in your research or applications, please cite using this BibTeX for the technical report:
 ```
 @article{guangsen2022-botsim-tr,
   author    = {Guangsen Wang and Junnan Li and Shafiq Joty and Steven Hoi},
@@ -108,7 +108,7 @@ If you're using BotSIM in your research or applications, please cite using this
   archivePrefix = {arXiv},
 }
 ```
-or the following BibTex for our system demo paper:
+or the following BibTex for the system demo paper:
 ```
 @article{guangsen2022-botsim-demo,
   author    = {Guangsen Wang and Samson Tan and Shafqi Joty and Guang Wu and Jimmy Au and Steven Hoi},
 
@@ -1,17 +1,17 @@
 Extending BotSIM to new bot platforms
 #######################################
 Bot developers can extend BotSIM to new platforms by implementing their platform-dependent parsers and API clients. 
-They serve as the “adaptors” in order to apply BotSIM’s the “generation-simulation-remediation” pipeline.
+They serve as the “adaptors” in order to apply BotSIM’s “generation-simulation-remediation” pipeline.
 
 Parser
 **************************************************************
 The parser interface is defined in generator.parser and has the following important functions to implement. 
-As these functions are highly platform dependent, the implementation might be non-trivial and require access to bot design documentations from the bot platform provider.  
+As these functions are highly platform dependent, the implementation might be non-trivial and require access to bot design documentation from the bot platform provider.  
 We provide our initial parser implementations for the Einstein BotBuilder (``platform.botbuilder``) and Google DialogFlow CX (``platform.dialogflow_cx``) platforms.  
 The utility functions supporting the parsers are under  ``modules.generator.utils.<platform-name>/parser_utilities.py``
 
-1. ``extract_local_dialog_act_map`` function generates a “local” dialog act map by ignoring incoming and outputting  transitions. In other words, the local map only considers the messages/actions explicitly defined within the dialog. These local dialog act maps are modelled as graph nodes during the subsequent conversation graph modelling. In particular, the messages for the two special dialog acts, namely "intent_success_message"and "dialog_success_message" are also generated here according to the following heuristics:   "intent_success_message" contains the first request message and all its previous normal messages    "dialog_success_message" contains the last messages.
-2. ``conversation_graph_modelling`` models the entire bot design as a graph. Each individual dialog is represented by its local dialog act maps and modelled as the graph nodes. Transitions among the individual dialogs are modelled as the graph edges. The graph modelling is based on the networkx package. There are two outputs from the function: the final dialog act maps and the graph data for conversation path visualisation.
+1. ``extract_local_dialog_act_map`` function generates a “local” dialog act map by ignoring incoming and output  transitions. In other words, the local map only considers the messages/actions explicitly defined within the dialog. These local dialog act maps are modelled as graph nodes during the subsequent conversation graph modelling. In particular, the messages for the two special dialog acts, namely "intent_success_message"and "dialog_success_message" are also generated here according to the following heuristics:   "intent_success_message" contains the first request message and all its previous normal messages    "dialog_success_message" contains the last messages.
+2. ``conversation_graph_modelling`` models the entire bot design as a graph. Each individual dialog is represented by its local dialog act maps and modelled as the graph nodes. Transitions among the individual dialogs are modelled as the graph edges. The graph modelling is based on the ``networkx`` package. There are two outputs from the function: the final dialog act maps and the graph data for conversation path visualisation.
 3. ``parse`` function defines a general parser pipeline for all platforms starting from parsed local dialog act maps.
 
    .. code-block:: python
 
@@ -4,8 +4,15 @@ Remediator Dashboard Navigation
 
 Bot Health Reports
 **************************************************************
-The bot health dashboard consists of a set of multi-level performance reports. At the highest level, users can have a historical view of most recent simulation/test sessions (e.g., after each major bot update). The historical performance comparison can help users evaluate the impacts of bot changes quantitatively, from which they can make decisions like whether or not keep certain changes.
-In the session-specific performance summary, users can zoom in for more details of a selected test session including the data distribution, overall dialog performance metrics. Furthermore, one can select a dialog/intent of the specific testing session to investigate the detailed intent and NER performance in the dialog-specific performance summary. Through the dialog-specific performance report, one can quickly identify the most confusing intents and entities. This saves significant efforts and helps better allocation of resources for troubleshooting and bot improvement.
+The bot health dashboard consists of a set of multi-level performance reports. At the highest level, 
+users can have a historical view of most recent simulation/test sessions (e.g., after each major bot update). 
+The historical performance comparison can help users evaluate the impacts of bot changes quantitatively, 
+from which they can make decisions like whether or not to keep certain changes.
+In the session-specific performance summary, users can zoom in for more details of a selected test session 
+including the data distribution, overall dialog performance metrics. Furthermore, one can select a dialog/intent of 
+the specific testing session to investigate the detailed intent and NER performance in the dialog-specific performance summary. 
+Through the dialog-specific performance report, one can quickly identify the most confusing intents and entities. 
+This saves significant efforts and helps better allocation of resources for troubleshooting and bot improvement.
 
 .. image:: _static/BotSIM_Performance_Report.png
   :width: 550
@@ -16,8 +23,8 @@ In addition to the diagnosis reports, the remediator also provides actionable in
 The remediation dashboards given below allow detailed investigation of all intent or NER errors along with their corresponding simulated chat logs.
 The root causes of the failed conversations are identified via backtracking of the simulation agenda.
 For troubleshooting intent models, the remediator attempts to identify the intent utterances and paraphrases that are wrongly predicted by the current model. Depending on the wrongly classified intent classes, the remediator would suggest some follow-up actions including 1) augmenting the intent training set with the queries deemed to be out-of-domain by the current intent model, 2) moving the intent utterance to another intent if most of paraphrases of the former intent utterance are classified to the latter intent. 
-Similarly for NER model, the remediator collects all the wrongly extracted entities and the messages with such entities. Depending on the entity extraction method, users can follow the suggestions to troubleshooting or improving the bot NER capabilities.
-Note the suggestions are meant to be used as guidelines rather than strictly followed. More importantly, they can always be extended by users to include domain expertise in troubleshooting bots related to their products/services.
+Similarly for the NER model, the remediator collects all the wrongly extracted entities and the messages with such entities. Depending on the entity extraction method, users can follow the suggestions to troubleshooting or improving the bot NER capabilities.
+Note the suggestions are meant to be used as guidelines rather than strictly followed. More importantly, users can always extend them to include domain expertise in troubleshooting bots related to their products/services.
 
 .. image:: _static/Dashboard_Intent_Remediation.png
   :width: 550
@@ -30,7 +37,7 @@ Conversational Analytics
 Another useful component of the Remediator is the suite of conversation analytical tools. They further help bot practitioners gain more insights for troubleshooting and 
 improving their dialog systems. The confusion matrix analysis breaks down the intent model performance into (sortable) recall, precision and F1 accuracies to help identify the 
 worse performing intents. Another useful analytical tool is the tSNE~clustering of the intent utterances using sentence transformer embeddings. The tSNE visualisation enables users 
-to gauge the training data quality. It is also an effective tool in identifying overlapping intents and can potential benefit new intent discovery as well.
+to gauge the training data quality. It is also an effective tool in identifying overlapping intents and can potentially benefit new intent discovery as well.
 Lastly, powered by parsers' conversation graph modelling capability, the dialog path explorer can be used to visualise different conversation flows of the current bot design. 
 For example, users can select the source and target dialogs and investigate the generated dialog paths. Not only is the tool valuable for comprehensive testing coverage of conversation paths, 
 it also offers a controllable approach to troubleshooting dialog design related errors or even improving the current design.
@@ -44,16 +51,16 @@ Apply intent model remediation suggestions
 The most straightforward approach of applying remediation suggestions is to augment the the recommended misclassified paraphrases  to the original
 training set to retrain the intent model. 
 
-For Einstein BotBuilder platform, new intent sets can be created as a csv file to include the augmented training set. The csv file can be deployed
+For the Einstein BotBuilder platform, new intent sets can be created as a csv file to include the augmented training set. The csv file can be deployed
 to users' org via `Salesforce Workbench <https://workbench.developerforce.com/login.php>`_. The new intent model can be retrained by associate the 
 new intent set name ``report_issue_dev_augmented`` with the ``Report an Issue`` intent.
 
-.. csv-table:: Snippet of augmented intent set csv file for Einstein BotBuilder Platform
+.. csv-table:: Snippet of augmented intent set csv file for the Einstein BotBuilder Platform
    :file: augmented.csv
    :widths: 5,5,90
    :header-rows: 1
 
-For DialogFlow CX, the recommanded paraphrases can be add back to the corresponding training set and the intent model will be automatically retrained.
+For DialogFlow CX, the recommended paraphrases can be add back to the corresponding training set and the intent model will be automatically retrained.
 
 The table below shows the intent F1 score comparison before and after intent model retraining based on the simulation goals created from the same evaluation set.