vllm.compilation.partition_rules ¶
inductor_partition_rule_context ¶
Context manager to temporarily register Inductor partition rules.
Registers custom partition rules for specified operators, forcing the Inductor scheduler to partition the graph at these operators. The rules are automatically restored to their previous state on exit.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
splitting_ops | list[str] | List of operator names to partition on. | required |
Source code in vllm/compilation/partition_rules.py
should_split ¶
Check if a node should be split for dynamo graph partition. It operates on dynamo graph, so the node.target can be anything. We need to check and split only on OpOverload and OpOverloadPacket.