torch.load¶
- torch.load(f, map_location=None, pickle_module=pickle, *, weights_only=False, **pickle_load_args)[source]¶
Loads an object saved with
torch.save()
from a file.torch.load()
uses Python’s unpickling facilities but treats storages, which underlie tensors, specially. They are first deserialized on the CPU and are then moved to the device they were saved from. If this fails (e.g. because the run time system doesn’t have certain devices), an exception is raised. However, storages can be dynamically remapped to an alternative set of devices using themap_location
argument.If
map_location
is a callable, it will be called once for each serialized storage with two arguments: storage and location. The storage argument will be the initial deserialization of the storage, residing on the CPU. Each serialized storage has a location tag associated with it which identifies the device it was saved from, and this tag is the second argument passed tomap_location
. The builtin location tags are'cpu'
for CPU tensors and'cuda:device_id'
(e.g.'cuda:2'
) for CUDA tensors.map_location
should return eitherNone
or a storage. Ifmap_location
returns a storage, it will be used as the final deserialized object, already moved to the right device. Otherwise,torch.load()
will fall back to the default behavior, as ifmap_location
wasn’t specified.If
map_location
is atorch.device
object or a string containing a device tag, it indicates the location where all tensors should be loaded.Otherwise, if
map_location
is a dict, it will be used to remap location tags appearing in the file (keys), to ones that specify where to put the storages (values).User extensions can register their own location tags and tagging and deserialization methods using
torch.serialization.register_package()
.- Parameters:
f (Union[str, PathLike, BinaryIO, IO[bytes]]) – a file-like object (has to implement
read()
,readline()
,tell()
, andseek()
), or a string or os.PathLike object containing a file namemap_location (Optional[Union[Callable[[Tensor, str], Tensor], device, str, Dict[str, str]]]) – a function,
torch.device
, string or a dict specifying how to remap storage locationspickle_module (Optional[Any]) – module used for unpickling metadata and objects (has to match the
pickle_module
used to serialize file)weights_only (bool) – Indicates whether unpickler should be restricted to loading only tensors, primitive types and dictionaries
pickle_load_args (Any) – (Python 3 only) optional keyword arguments passed over to
pickle_module.load()
andpickle_module.Unpickler()
, e.g.,errors=...
.
- Return type:
Warning
torch.load()
unless weights_only parameter is set to True, usespickle
module implicitly, which is known to be insecure. It is possible to construct malicious pickle data which will execute arbitrary code during unpickling. Never load data that could have come from an untrusted source in an unsafe mode, or that could have been tampered with. Only load data you trust.Note
When you call
torch.load()
on a file which contains GPU tensors, those tensors will be loaded to GPU by default. You can calltorch.load(.., map_location='cpu')
and thenload_state_dict()
to avoid GPU RAM surge when loading a model checkpoint.Note
By default, we decode byte strings as
utf-8
. This is to avoid a common error caseUnicodeDecodeError: 'ascii' codec can't decode byte 0x...
when loading files saved by Python 2 in Python 3. If this default is incorrect, you may use an extraencoding
keyword argument to specify how these objects should be loaded, e.g.,encoding='latin1'
decodes them to strings usinglatin1
encoding, andencoding='bytes'
keeps them as byte arrays which can be decoded later withbyte_array.decode(...)
.Example
>>> torch.load('tensors.pt') # Load all tensors onto the CPU >>> torch.load('tensors.pt', map_location=torch.device('cpu')) # Load all tensors onto the CPU, using a function >>> torch.load('tensors.pt', map_location=lambda storage, loc: storage) # Load all tensors onto GPU 1 >>> torch.load('tensors.pt', map_location=lambda storage, loc: storage.cuda(1)) # Map tensors from GPU 1 to GPU 0 >>> torch.load('tensors.pt', map_location={'cuda:1': 'cuda:0'}) # Load tensor from io.BytesIO object >>> with open('tensor.pt', 'rb') as f: ... buffer = io.BytesIO(f.read()) >>> torch.load(buffer) # Load a module with 'ascii' encoding for unpickling >>> torch.load('module.pt', encoding='ascii')