[Feature] Support batched generation (inference) in evo2.

### Problem & Motivation

* Customers have observed significant speedups when they need to generate based on multiple prompts using batched generations.
* Currently fir/irr state are maintained without batch index, so to get batching we would need to introduce batch index in inference_context.fir_state etc in the inference kernels.

### BioNeMo Framework Version

cd74c2bdce221fa44879fd648f94febc4de6c145

### Category

Inference

### Proposed Solution

* Add batch index to fir/irr state are maintained without batch index, so to get batching we would need to introduce batch index in inference_context.fir_state etc in NeMO.
* Add test coverage for batched inference.

### Expected Benefits

* Significant (10x+) performance gains for many shorter generations.

### Code Example

```python

```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Support batched generation (inference) in evo2. #1152

Problem & Motivation

BioNeMo Framework Version

Category

Proposed Solution

Expected Benefits

Code Example

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[Feature] Support batched generation (inference) in evo2. #1152

Description

Problem & Motivation

BioNeMo Framework Version

Category

Proposed Solution

Expected Benefits

Code Example

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions