jjzjj

ChildFailedError

全部标签

torch.distributed.elastic.multiprocessing.errors.ChildFailedError

问题Traceback(mostrecentcalllast):File"/ssd1/miniconda3/envs/pytorch2.1.2/bin/torchrun",line33,insys.exit(load_entry_point('torch==2.1.2','console_scripts','torchrun')())File"/ssd1/miniconda3/envs/pytorch2.1.2/lib/python3.8/site-packages/torch/distributed/elastic/multiprocessing/errors/__init__.py",li