Create the tensor and its corresponding IPC handle, then perform an all-gather operation to enable peer access. This should be done on the host side.
Proposal: Introduce a create_distributed_tensor feature that enables initialization of distributed tensors, including the creation and transmission of IPC handles.
Reference: DeepEP