Running your first test
Objective
In this guide, you will learn how to
- Launch Moonshot UI
- Run tests using benchmark and perform red teaming on one of the OpenAI models.
Launch Moonshot UI
Moonshot UI is designed to simplify the testing workflows. Once Moonshot is installed, you can start the Web UI using the following command:
Then, use your browser and navigate to http://localhost:3000
Note
We will be testing a model from OpenAI in this guide. You will need to prepare an OpenAI token.
Run Benchmark Test
Upon navigating to the webpage, you will be greeted with our main screen. To start a benchmark test, click on "Get Started".
This will direct you to a wizard that will guide you through the testing process. Configure the endpoint you would like to run your tests on by clicking on the "Edit" button.
Enter your API token on this screen, then click "Save".
Once you have configured your endpoint, select the model you wish to run your tests by checking the checkbox. In this example, the tests will be run on OpenAI GPT3.5 Turbo. Then, click on "NEXT".
Now, we will select the test(s) we want to run. In this tutorial, we will run just one test: Facts about Singapore.
Select the test by clicking on the checkbox. Then, click on "NEXT".
Enter the name of the run, and optionally the description.
Select the number of prompts you want to run by using the slider. Then, click on "RUN".
The progress will be shown in the progress bar. When the test completes, you should see that 100% on the progress bar.
Click on "View Report".
You should see your benchmark report. Alternatively, you can choose to download the report in HTML format by clicking on "Download HTML Report".
And there you have it! You have successfully ran your first benchmark test.
Run Red Teaming
To initiate red teaming, click on the icon in the sidebar and select "Start New Session".
Select one or more endpoints to red team on this screen. In this example, we will be using OpenAI GPT4 and OpenAI GPT3.5 Turbo. Then, click on the arrow.
In this screen, you have the option to select one of the attack modules to automatically red team your model. For the purposes of this guide, we will skip this step. Click "Skip for now".
Warning
Important information before running red teaming:
Certain attack modules may require connection to certain model endpoints. (i.e. Violent Durian requires the endpoint openai-gpt4 and you will need an API token to connect to this endpoint.
Refer to this list for the requirements.
Enter a name and type a description in this screen, then click "Start".
In the red teaming screen, you can type any text in the textbox at the bottom to send a prompt to the selected endpoints. The prompt will be sent to all endpoints.
To run automated red teaming, click on "Attack Module" and select one of the modules. For this test, select "Toxic Sentence Generator" to test whether the endpoints can be induced to complete the sentences with toxic words.
Type a cuss word in the prompt. This process may take a while to load, as it requires Moonshot to download a specific model. Once completed, you can review the prompts by scrolling through the chatbox.