[Nano] Fix the long warm-up time of jit model #7247

ACupofAir · 2023-01-12T08:04:33Z

Description

Add warm up action when optimize jit model by trace or quantize function

1. Why the change?

To solve the problem Too long warm-up time for TorchScript models #7062

2. User API changes

None

3. Summary of the change

Add warm up action before return jit model optimized by trace or quantize
Add warning when load jit model

4. How to test?

test code: change the PRECISION to test bf16_jit or fp32_jit

#%%
import torch
from torchvision.models import resnet18
from bigdl.nano.pytorch import InferenceOptimizer

#%%
USE_JIT = True
# USE_JIT = False
# PRECISION = "fp32"
PRECISION = "bf16"

model_ft = resnet18(pretrained=True)

if PRECISION == "fp32":
    jit_model = InferenceOptimizer.trace(
        model_ft,
        accelerator="jit",
        use_ipex=True,
        input_sample=torch.rand(1, 3, 224, 224),
        thread_num=1,
    )
elif PRECISION == "bf16":
    jit_model = InferenceOptimizer.quantize(
        model_ft,
        precision="bf16",
        accelerator="jit",
        use_ipex=True,
        input_sample=torch.rand(1, 3, 224, 224),
        thread_num=1,
    )
else:
    print("Error: The PRECISION must be 'fp32' or 'bf16'")

x = torch.rand(3, 3, 224, 224)
#%%
with torch.no_grad():
    with torch.jit.optimized_execution(USE_JIT):
        %time y_hat = jit_model(x)

#%%
with torch.no_grad():
    with torch.jit.optimized_execution(USE_JIT):
        %time y_hat = jit_model(x)
#%%
with torch.no_grad():
    with torch.jit.optimized_execution(USE_JIT):
        %time y_hat = jit_model(x)
#%%
with torch.no_grad():
    with torch.jit.optimized_execution(USE_JIT):
        %time y_hat = jit_model(x)
#%%
with torch.no_grad():
    with torch.jit.optimized_execution(USE_JIT):
        %time y_hat = jit_model(x)
#%%
with torch.no_grad():
    with torch.jit.optimized_execution(USE_JIT):
        %time y_hat = jit_model(x)
#%%

5. Result of test

Result without warmup before(bf16)
Result after warmup

ACupofAir added 3 commits January 12, 2023 15:07

add warmup for fp32 and bf16 with jit acce

39c0c53

add warmup warming for load from jit model

311cb7f

Merge branch 'main' into junw/fix/jit_warmup

2ea9937

TheaperDeng added the Nano label Feb 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Nano] Fix the long warm-up time of jit model #7247

[Nano] Fix the long warm-up time of jit model #7247

[Nano] Fix the long warm-up time of jit model #7247

Are you sure you want to change the base?

[Nano] Fix the long warm-up time of jit model #7247

Conversation

Description

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

5. Result of test