The X-AnyLabeling
Chatbot is an integrated AI assistant that allows users to interact directly with Large Language Models (LLMs) within their labeling workflow.
This feature enables you to engage in chat conversations using natural language, batch process image-text question-answering data, and supports one-click import/export of multimodal image data in the ShareGPT format (based on single-turn or multi-turn conversations) for direct use in fine-tuning frameworks like LLaMA-Factory.
open_chatbot_and_chat.mp4
To open the Chatbot, click the Chatbot icon in the left toolbar of X-AnyLabeling or use the following keyboard shortcut to quickly launch the Chatbot interface:
- Windows/Linux:
Ctrl
+b
- macOS:
⌘
+b
On the first launch, you need to configure the necessary API credentials and models in the right panel of the Chatbot
window.
The Chatbot features a three-panel interface designed for streamlined interaction.
select_model.mp4
Provider | API Key | API Docs | Model Docs |
---|---|---|---|
Anthropic | Link | Link | Link |
DeepSeek | Link | Link | Link |
Google AI | Link | Link | Link |
Ollama | - | Link | Link |
OpenAI | Link | Link | Link |
OpenRouter | Link | Link | Link |
Qwen | Link | Link | Link |
- Chat Window: View your conversation history with the AI. Supports copy, edit, delete, and rerun functions.
- Message Input: Enter your questions or instructions. Includes a one-click option to clear the current conversation history.
- Special Command: After importing an image, use the
@image
prompt to include the current image in your query.
- Image Preview: Displays the current image.
- Function Components:
- Image Navigation: Switch between the previous and next images.
- Image Import: Import a single image file or an entire directory.
- Data Export: Export annotation results.
- Batch Processing: Run processing tasks on multiple images.
- Backend Settings: Configure API endpoints, keys, and select models.
- Generation Parameters: Input system prompts, temperature settings, and maximum output length.
import_image_folder.mp4
Ask questions about the current image to efficiently generate single-turn or multi-turn image-text dialogues:
@image Please describe this image.
batch_run.mp4
Process multiple images with the same prompt to speed up your workflow:
- Load an image folder.
- Click the "Run All Images" button.
- Enter the prompt to apply to all images.
export_sharegpt.mp4
Supports one-click export of multimodal image data in the ShareGPT format.
The Chatbot stores its configuration in the following location within the user's home directory:
~/.xanylabeling_data/chatbot/
This includes:
models.json
: Contains user preferences and model configurations.providers.json
: API provider settings.
Ctrl
/⌘
+Enter
: Send messageEnter
: Add a new line in the message input
- The Chatbot feature is currently in beta and may be updated in future versions.
- Local models via Ollama can be used without an internet connection and theoretically support any model that adheres to the OpenAI-compatible API standard.