Update Mixtral BS #345

yeandy · 2024-07-02T14:24:38Z

Description

Update Mixtral batch size since 256 taking very long time.

Tests

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run one-shot tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed.

vipannalla · 2024-07-02T16:24:57Z

Higher batch_size is supposed to be faster given the same number of samples. How much slower is the current run? and how much faster is the run with batch_size = 128?

yeandy · 2024-07-02T17:15:20Z

It takes about ~12 min for 128 batch on bf16. Haven't tested for int8.

Not sure why it's hitting timeout at 3 hours for int8 version. Let me try running int8 at 128.

RissyRan · 2024-07-18T02:32:34Z

dags/inference/maxtext_inference.py

@@ -457,7 +457,7 @@
          "quant_mode": W_INT8_KV_INT8,
          "quantization": "int8",
          "quantize_kvcache": "true",
-          "per_device_batch_size": 258,


Wait! we have already quantized for inference on MaxText?

Which implementation is this one? I don't think I enabled quantization for both matmul or megablox yet (I mean MoE block, other blocks are enabled). Or other implementation we are talking about here?

vipannalla · 2024-07-19T04:37:10Z

I think this configuration is using the old for for loop implementation, which Ranran cleaned up recently. I don't think the current code support quantization right now.

Update BS

f72c9b7

yeandy requested review from vipannalla, morgandu and mailvijayasingh as code owners July 2, 2024 14:24

RissyRan reviewed Jul 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Mixtral BS #345

Update Mixtral BS #345

yeandy commented Jul 2, 2024

vipannalla commented Jul 2, 2024

yeandy commented Jul 2, 2024

RissyRan Jul 18, 2024

yeandy Jul 18, 2024

RissyRan Jul 19, 2024 •

edited

Loading

vipannalla commented Jul 19, 2024

Update Mixtral BS #345

Are you sure you want to change the base?

Update Mixtral BS #345

Conversation

yeandy commented Jul 2, 2024

Description

Tests

Checklist

vipannalla commented Jul 2, 2024

yeandy commented Jul 2, 2024

RissyRan Jul 18, 2024

Choose a reason for hiding this comment

yeandy Jul 18, 2024

Choose a reason for hiding this comment

RissyRan Jul 19, 2024 • edited Loading

Choose a reason for hiding this comment

vipannalla commented Jul 19, 2024

RissyRan Jul 19, 2024 •

edited

Loading