create_source
Create a new data source in BigML from a remote URL. The source is the first step in the ML pipeline — raw data is imported and parsed for further processing into datasets.
Supports CSV files, JSON, Excel files, and other formats accessible via URL.
create_batch_prediction
Create a batch prediction to generate predictions for an entire dataset at once. More efficient than individual predictions when processing large volumes of data.
The batch prediction runs asynchronously; check its status to know when results are ready.
update_resource
Update a BigML resource's mutable attributes. BigML resources are mostly immutable — only metadata fields like name, description, tags, and category can be updated after creation.
list_resources
List BigML resources of a given type with filtering, ordering, and pagination. Returns a paginated list of resources along with metadata about the total count and navigation.
Use filters to narrow results by name, tags, creation date, and other resource-specific fields.
create_cluster
Create an unsupervised cluster model to group data instances by similarity. Computes centroids representing the center of each cluster.
After creation, use predictions to assign new data points to the nearest cluster centroid.
manage_project
Create or update a BigML project. Projects organize resources into logical groups. Resources can be assigned to a project during creation, and all child resources inherit the project from their parent source.
create_dataset
Create a new dataset in BigML from a source, another dataset, or a list of datasets. Datasets are processed, structured representations of data with statistical summaries for each field.
Supports sampling, filtering, field selection, and train/test splitting.
delete_resource
Permanently delete a BigML resource. This action is irreversible.
Deleting a project will also delete all resources within that project.
execute_whizzml
Execute a WhizzML script on BigML's servers. WhizzML is a domain-specific language for automating ML workflows. Provide either an existing script ID or inline source code to execute.
Executions run asynchronously — check the execution status and retrieve results when finished.
get_resource
Retrieve detailed information about a specific BigML resource by its ID. Returns the full resource object including status, fields, configuration, and results.
Useful for checking the status of asynchronous operations (source creation, model training, evaluations, etc.) and retrieving model metrics.
train_model
Train a machine learning model from a dataset. Supports multiple model types: decision tree, ensemble (random forest, boosted trees), deepnet (neural network), logistic regression, linear regression, and time series.
Choose the model type based on your task — classification, regression, forecasting, etc.
create_evaluation
Evaluate a supervised model's performance by comparing its predictions against a test dataset. Returns metrics like accuracy, precision, recall, and F-measure for classification tasks, or MSE and R-squared for regression tasks.
Works with decision trees, ensembles, deepnets, logistic regressions, linear regressions, and time series models.
create_prediction
Generate a prediction from a trained supervised model. Supports predictions from decision trees, ensembles, deepnets, logistic regressions, linear regressions, and fusions.
Provide input data as field-value pairs. Returns the predicted outcome along with confidence/probability information.
create_anomaly_detector
Create an anomaly detector using isolation forest algorithms. Identifies unusual data points in a dataset by measuring how easily they can be isolated from the rest of the data.
After creation, use anomaly score predictions to evaluate how anomalous new data points are.
create_optiml
Create an OptiML for automated model selection and hyperparameter optimization. BigML will automatically create and evaluate hundreds of models with different algorithms and configurations to find the best performing model for your dataset.
Supports classification and regression tasks.