-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Chronos: add a how to guide to help users optimize their forecaster with forecaster.optimize
#6926
base: main_backup
Are you sure you want to change the base?
Conversation
pip install --pre --upgrade bigdl-chronos[pytorch,inference]
|
You may also need to change some other files to make sure the how to guide could be correctly rendered on the document. After that, you may generate a document page with your own readthedoc id for preview. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
others LGTM
@@ -24,6 +24,7 @@ Forecasting | |||
* `Export the OpenVINO model files to disk <how_to_export_openvino_files.html>`__ | |||
* `Export the TorchScript model files to disk <how_to_export_torchscript_files.html>`__ | |||
* `Preprocess my data <how_to_preprocess_my_data.html>`__ | |||
* `Optimize a forecaster <how_to_optimize_a_forecaster.html>`__ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
could you rebase this file and put this howto under https://bigdl.readthedocs.io/en/latest/doc/Chronos/Howto/index.html#speed-up-a-forecaster this section
others LGTM, Please add the "open in colab" and "bigdl logo", e.g. https://github.com/intel-analytics/BigDL/blob/main/python/chronos/colab-notebook/howto/how_to_speedup_inference_of_forecaster_through_ONNXRuntime.ipynb |
6020b58
to
f2325db
Compare
Description
This method will traverse existing optimization methods(onnxruntime, openvino, jit, …) and save the model with minimum latency under the given data and search restrictions(accelerator, precision, accuracy_criterion) in forecaster.accelerated_model. This post provides a way by which users could replay the method.
1. Why the change?
2. User API changes
Add a new how-to guide.
3. Summary of the change
a new notebook
4. How to test?