Vectice
Vectice package.
The Vectice package is a library allowing data-scientists to record their progress in the Vectice app.
This package exposes essential Vectice classes and methods:
- the autolog method
- the connect method
- the Workspace class
- the Project class
- the Phase class
- the Iteration class
- the Dataset class
- the Model class
-
the Table class
IMPORTANT INFORMATION
Vectice calculates dataframe statistics only if the dataframe contains more than a hundred rows for privacy reasons. Those statistics are calculate on a sample of a million rows and on the first 400 columns by default. All of these values can be changed on the Organization Settings page by the organization admin.
code_capture
module-attribute
¶
code_capture = True
Global code capture flag, enabled by default.
Code capture is triggered when registering a dataset or a model, and only works when a valid Git repository is found. Otherwise a warning is logged, telling what might be misconfigured in the repository.
Captured information include the repository name, URL, branch name, commit hash, and whether the repository is dirty (has uncommitted changes).
Examples:
To disable code capture globally:
>>> import vectice
>>> vectice.code_capture = False
To re-enable code capture globally:
>>> import vectice
>>> vectice.code_capture = True
code_file_capture
module-attribute
¶
code_file_capture = False
Global code file capture flag, disabled by default.
Controls whether code file is captured when logging asset into Vectice as part of the lineage. When enabled (set to True), logging an asset will also add the executed file to its lineage. Enabling this feature may increase API runtime due to file transfer. Databricks is currently not supported.
Examples:
To enable code_file_capture globally:
>>> import vectice
>>> vectice.code_file_capture = True
To re-enable code file capture globally:
>>> import vectice
>>> vectice.code_file_capture = False
auto_extract
module-attribute
¶
auto_extract = True
Global auto extraction flag, enabled by default.
Extraction is automatically performed when registering a dataset or a model with an attachment. Currently, it exclusively operates on Excel files and extracts sheets (as CSV files) and images to be referenced inside Vectice.
Examples:
To disable auto extraction of files globally globally:
>>> import vectice
>>> vectice.auto_extract = False
To re-enable auto extraction globally:
>>> import vectice
>>> vectice.auto_extract = True
pickle_capture
module-attribute
¶
pickle_capture = True
Global pickle capture flag, enabled by default.
Pickle capture is triggered when registering a Vectice Model and only works when a valid predictor parameter is passed or when Autolog detects an estimator, for example a scikit-learn regressor. The predictor is pickled and attached to the Model Version in Vectice.
Examples:
To disable pickle capture globally:
>>> import vectice
>>> vectice.pickle_capture = False
To re-enable pickle capture globally:
>>> import vectice
>>> vectice.pickle_capture = True
Standard API
>>> from vectice import Model
...
>>> my_estimator = LinearRegression()
>>> model = Model(predictor=my_estimator)
Autolog
>>> from vectice import autolog
...
>>> my_estimator = LinearRegression()
>>> autolog.cell()