feat: data parallel inference examples#2805

bowang007 · 2024-05-02T00:29:08Z

Description

This PR shows a simple example about using accelerate library for data parallel inference.

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes
I have added the relevant labels to my PR in so that relevant reviewers are notified

github-actions

There are some changes that do not conform to Python style guidelines:

--- /home/runner/work/TensorRT/TensorRT/examples/distributed_inference/data_parallel_gpt2.py 2024-05-02 00:29:27.054073+00:00+++ /home/runner/work/TensorRT/TensorRT/examples/distributed_inference/data_parallel_gpt2.py 2024-05-02 00:31:18.785078+00:00@@ -13,12 +13,26 @@ distributed_state = PartialState() model = GPT2LMHeadModel.from_pretrained("gpt2").eval().to(distributed_state.device) -model.forward = torch.compile(model.forward, backend="torch_tensorrt", options={"truncate_long_and_double": True, "enabled_precisions":{torch.float16}, "debug": True}, dynamic=False,)+model.forward = torch.compile(+ model.forward,+ backend="torch_tensorrt",+ options={+ "truncate_long_and_double": True,+ "enabled_precisions":{torch.float16},+ "debug": True,+ },+ dynamic=False,+) with distributed_state.split_between_processes([input_id1, input_id2]) as prompt: cur_input = torch.clone(prompt[0]).to(distributed_state.device) - gen_tokens = model.generate(cur_input, do_sample=True, temperature=0.9, max_length=100,)+ gen_tokens = model.generate(+ cur_input,+ do_sample=True,+ temperature=0.9,+ max_length=100,+ ) gen_text = tokenizer.batch_decode(gen_tokens)[0]

narendasan

Need a requirements.txt
Annotate the script with description of whats happening https://github.com/pytorch/TensorRT/blob/main/examples/dynamo/torch_compile_advanced_usage.py
Add a reference to index.rst so that it gets rendered in the docs:
TensorRT/docsrc/index.rst
Line 113 in 12e885a
tutorials/_rendered_examples/dynamo/torch_compile_stable_diffusion

narendasan

LGTM

HolyWu · 2024-05-17T12:27:44Z

@bowang007 You didn't properly clean up the merge conflicts, therefore db24b3b had <<<<<<< HEAD, ======= and >>>>>>> dfbf6ea84 (feat: data parallel inference sample) remaining in docsrc/index.rst.

facebook-github-bot added the cla signed label May 2, 2024

github-actionsbot requested changes May 2, 2024
View reviewed changes

bowang007 changed the title ~~feat: data parallel inference sample~~feat: data parallel inference examplesMay 2, 2024

bowang007 requested review from apbose, chohk88, gs-olive, narendasan, peri044 and zewenli98 May 3, 2024 01:05

narendasan reviewed May 3, 2024
View reviewed changes

github-actionsbot added the documentation Improvements or additions to documentation label May 7, 2024

narendasan approved these changes May 14, 2024
View reviewed changes

bowang007 force-pushed the multi_gpu_support branch from 4bc05b7 to dfbf6eaCompare May 16, 2024 23:07

feat: data parallel inference sample
7b4b504

bowang007 force-pushed the multi_gpu_support branch from dfbf6ea to 7b4b504Compare May 16, 2024 23:08

bowang007 merged commit db24b3b into mainMay 17, 2024

bowang007 added a commit that referenced this pull request May 17, 2024
chore: cherry pick of#2805
014ab40

peri044 pushed a commit that referenced this pull request May 21, 2024
chore: cherry pick of#2805(#2851)
93e8f29

laikhtewari pushed a commit that referenced this pull request May 24, 2024
chore: cherry pick of#2805(#2851)
2a8645c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: data parallel inference examples#2805

feat: data parallel inference examples #2805

Uh oh!

bowang007 commented May 2, 2024

Uh oh!

github-actionsbot left a comment

Uh oh!

narendasan left a comment

Uh oh!

narendasan left a comment

Uh oh!

HolyWu commented May 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

feat: data parallel inference examples#2805

feat: data parallel inference examples #2805

Uh oh!

Conversation

bowang007 commented May 2, 2024

Description

Checklist:

Uh oh!

github-actionsbot left a comment

Choose a reason for hiding this comment

Uh oh!

narendasan left a comment

Choose a reason for hiding this comment

Uh oh!

narendasan left a comment

Choose a reason for hiding this comment

Uh oh!

HolyWu commented May 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants