AI Model Size to Inference Latency Converter Converter Enter AI model size and select architecture type to estimate inference latency. Model Size: Unit: Megabytes (MB) Gigabytes (GB) Architecture Type: CNN (Convolutional Neural Network) Transformer MLP / Fully Connected RNN / LSTM Estimate Latency Download as PDF