Ejento AI
Guides
QuickstartRecipesREST APIsRelease NotesFAQs
Guides
QuickstartRecipesREST APIsRelease NotesFAQs
Ejento AI
  1. Guides
  • Basic Operations
    • Features
      • Organization → Projects → Assistants → Teams Hierarchy
    • Guides
      • Login/Signup
  • Assistants
    • Overview
    • Features
      • Assistant Access Control
      • Caching Responses for Assistants
      • Assistant Evaluation
      • Evaluation Metrics
      • URL-based Chat Thread Creation and Prepopulation
      • Reasoning Patterns
    • Guides
      • Add Assistant
      • Evaluate Assistant
      • Edit Assistant
      • Assistant Edit Access
      • Embed Assistant
      • Delete Assistant
      • Add Favourite Assistants
      • View Assistant Id
      • View Dataset Id
      • Voice Calling with Assistants
  • Corpus
    • Overview
    • Features
      • Corpus Permissions
      • PII Redaction
    • Guides
      • Assistant Corpus Setup
      • Assistant Corpus Settings
      • Corpus Access Control
      • Corpus Connections
      • ETag Setup for Corpus Incremental Refresh
      • View Corpus Id
      • View Document Id
      • Tagging
        • Corpus tagging
        • Document tagging
  • Teams
    • Overview
    • Guides
      • Add a Team
      • Edit a Team
      • Delete a Team
      • View Team Id
  • Projects
    • Overview
    • Guides
      • Add a Project
      • Edit a Project
      • Managing Assistants in a Project
      • Delete a Project
      • View Project Id
  • User Settings
    • Overview
    • Features
      • Ejento AI User Access Levels
    • Guides
      • Add new user
      • View my User Id
  • API Keys
    • Overview
    • Guides
      • How to generate API Key and Auth Token
  • Workflows
    • Overview
    • Guides
      • Add Workflow
      • Workflow Chat
  • Tools
    • Overview
    • Guides
      • Tools Overview
      • Create External Tool
      • Connect Tool to Assistant
  • Analytics
    • Overview
    • Guides
      • Analyzing Data in the Analytics Dashboard
  • Chatlogs
    • Overview
    • Guides
      • Managing Chatlogs
      • View Chatlog & Chat thread Id
  • Integrations
    • Overview
    • Guides
      • Email Indexing
      • Microsoft Teams
      • Sharepoint Indexing
      • MS Teams Integration Setup
      • Creating a Connection in Credential Manager
      • Slack App
      • Discord Bot
  • Ejento AI Shield
    • Overview
    • Features
      • Understanding Guardrails
    • Guides
      • How to enable Guardrails
  • Assistant Security
    • Overview
    • Features
      • Assistant Red Teaming
    • Guides
      • Red Team an Assistant
Guides
QuickstartRecipesREST APIsRelease NotesFAQs
Guides
QuickstartRecipesREST APIsRelease NotesFAQs
Ejento AI
  1. Guides

Red Team an Assistant

1. Navigate to the assistants section from the sidebar#

Step 1 screenshot

2. Click on the three dots next to your assistant's name.#

Step 2 screenshot

3. Click on Red Team#

Step 3 screenshot

4. The AI Red Teaming Attack page will open.#

Step 4 screenshot

5. Click on Get Started#

Step 5 screenshot

6. Type a Name for the attack that reflects the scenario or risk area you want to test.#

Step 7 screenshot

7. In the red teaming tab, choose the Attack Type you want to run#

Step 8 screenshot

8. Click on Next#

Step 9 screenshot

9. From the list of available Probes, select the probe set that matches your chosen attack type or scenario.#

Each probe, converter, and detector will include a description outlining its use case to help you select the appropriate option for your scenario. For a complete guide on the different types of probes, converters, and detectors, view Assistant Red Teaming
Step 10 screenshot

10. Click on Next#

Step 11 screenshot

11. Select the Converter that defines how prompts and responses will be transformed or formatted before being sent to the model.#

Step 12 screenshot

12. Click on Next#

Step 13 screenshot

13. Choose the Scorer that will evaluate the assistant’s responses.#

Step 14 screenshot

14. Click on Start Attack#

Step 15 screenshot

15. A new job will appear in the jobs table with status Pending. After a short time, the status will update to Running as the attack is executed.#

Step 1 screenshot

16. Click on View Attack Details to review your configuration: Attack Type, Probe, Converter, and Scorer to ensure they match your testing goals.#

Step 2 screenshot

17. A modal will open where you can view the complete details of your attack.#

Step 3 screenshot

18. If your attack is in Running state and you want to stop it, click on Cancel Attack.#

Step 4 screenshot

19. Click on Cancel Attack. Remember that this action can't be undone.#

Step 5 screenshot

20. Once the attack completes, the job status will change to Completed.#

Step 6 screenshot

21. Click on View Result#

Step 7 screenshot

22. Here you can inspect the prompts, model responses, and Scorer outputs for each red team example.#

image.png
Previous
Assistant Red Teaming