create_model
Create Model
Create a new AI model in Nanonets. Supports OCR document extraction, image classification, and object detection model types. Define the categories or fields you want the model to recognize.
create_model
Create a new AI model in Nanonets. Supports OCR document extraction, image classification, and object detection model types. Define the categories or fields you want the model to recognize.
list_processed_files
List all files that have been processed by a Nanonets model within a date range. Returns both moderated (reviewed) and unmoderated files with their extraction results and review status.
extract_document_data
Upload a document to a Nanonets OCR model and extract structured data. Supports processing documents via URL. Returns extracted fields, tables, bounding boxes, and confidence scores. Use synchronous mode for small documents (≤3 pages) and async mode for larger files.
upload_training_data
Upload training images via URL to a Nanonets model. Supports OCR, image classification, and object detection models. For classification models, images must be associated with a category. For OCR and object detection, annotations with bounding boxes can be provided.
get_prediction_results
Retrieve extraction/prediction results for a previously processed document. Use this to check the status of async predictions or to retrieve detailed results by file ID or page ID.
review_file
Approve or unapprove a processed file in a Nanonets model. Approval marks the file's extracted data as verified. Unapproval reverts an approved file back to the unreviewed state.
retry_file_processing
Retry predictions or exports for previously processed files. Use this when files failed to process correctly or when exports need to be re-triggered.
get_model
Retrieve details about a Nanonets model including its current state, configured categories, and training status. Works for OCR, image classification, and object detection models.
extract_full_text
Extract the complete raw text from documents or images using Nanonets OCR. Unlike structured extraction, this returns all text found in the document without field mapping. Useful for getting the raw content of PDFs, scanned documents, and images.
detect_objects
Detect and locate objects within images using a trained Nanonets object detection model. Returns bounding box coordinates and confidence scores for each detected object.
classify_image
Classify images into predefined categories using a trained Nanonets image classification model. Provide one or more image URLs and receive category labels with probability scores.
train_model
Initiate training for a Nanonets model. Supports OCR, image classification, and object detection models. Training data must be uploaded before training can start.
Extract structured data from unstructured documents using AI-powered OCR and deep learning. Create and manage document processing workflows to extract fields and tables from invoices, receipts, forms, purchase orders, bank statements, and other document types. Upload documents as files or URLs for synchronous or asynchronous processing, receiving extracted data with confidence scores and bounding boxes. Train custom image classification and object detection models with labeled data. Manage human-in-the-loop review processes including approving, rejecting, and moderating extracted predictions. Apply post-processing transformations such as date formatting, currency detection, regex matching, lookup enrichment, and custom Python scripts. Import documents from email, Dropbox, Google Drive, OneDrive, and SharePoint. Export processed data to Google Sheets, QuickBooks, Xero, Salesforce, databases (PostgreSQL, MySQL, MSSQL), and other platforms. Receive webhook notifications on document processing, approval, rejection, and assignment events.
Common questions about connecting Nanonets to AI agents with Metorial.