- Step 1: Install and configure the MCP server following the provided instructions.
- Step 2: Upload images or PDFs via local storage or web URLs.
- Step 3: Use OCR tool to extract text from images or PDFs.
- Step 4: Use caption tool to generate descriptive summaries of images.
- Step 5: Retrieve processed data via the API for further use.