Cluster File System and Storage
Tip
If you have data stored in an Amazon S3™ bucket, then you can use datastores in MATLAB to directly access the data without needing any storage on the cluster. For details, see Transfer Data to Amazon S3 Buckets and Access Data Using MATLAB. You can also select the following storage options when creating your cluster.
This page lists the different file storage options available for your MATLAB® Parallel Server™ cluster on Amazon® Web Services (AWS®).
Cluster Shared Storage
Shared Storage. A persistent shared file system storage distributed across the cluster. This file system also contains your MATLAB Job Scheduler database. To request persistent shared file storage that persists after you stop your cluster, select a storage size. The shared storage is mounted at
/shared/persisted. When you stop the cluster, the content of/shared/persistedis retained as an EBS snapshot in your cloud account. When you delete the cluster, the content of /shared/persisted is permanently removed. For details, see the table below.Temporary Storage. A temporary shared file system distributed across the cluster. To request temporary shared file storage, the headnode must contain ephemeral storage or AWS Instance Store which is available with 'd' instance types with various storage sizes. When the cluster starts, the storage is mounted at
/shared/tmp. When you stop the cluster or if the cluster shuts down, the content of/shared/tmpis removed. Use it for temporary and intermediate data that you need to access from multiple worker nodes.MATLAB Drive. To enable access to MATLAB Drive, you must use a personal cluster. The files available on MATLAB Drive are read-only. You can access the files in your MATLAB drive at the mounted location
/MATLAB Driveby following steps at Access Files in Your MATLAB Drive (MATLAB).
Local Machine Storage
Ephemeral Storage: This type of storage is available only on instance types that contain "d" in their name, for example, m5ad.24xlarge. Each ephemeral storage device (NVMe SSD) is mounted at
/mnt/localnvmeN, whereNgoes from zero to the number of ephemeral storage devices minus one, for example,/mnt/localnvme1corresponds to the second ephemeral storage device. Additional EBS volumes are mounted at/mnt/localebsN. The availability and size of instance storage vary based on the EC2 instance type chosen. Use this storage for temporary, instance-specific data processing but not for persistent data storage.
After selecting your storage options, click Create Cluster. For details on other cluster settings, see Create a Cloud Cluster.
All worker machines have access to local and cluster shared storage. You can use these
folders for storing data generated by your jobs, and for data you want to transfer between the
cluster and your client location. See Transfer Data to or from a Cloud Center Cluster. The paths are the same for all worker machines
of the cluster. Changes to files and folders under /mnt/localebsN are not
visible to other machines. Files and folders under the /shared mount point
are shared by all worker machines of your cluster. Changes made by any machine are visible to
all other machines. Each folder has different longevity, as shown in the table.
| Location | Size | Usage |
|---|---|---|
/MATLAB Drive | Depends on your MATLAB license. For more information, see MATLAB Drive™ Storage Quota (MATLAB). | Enabled when personal cluster is selected at cluster creation. A read-only file system on the worker machines. |
/shared/persisted | Specified at cluster creation | The location of the cluster shared persisted storage and MATLAB Job
Scheduler (MJS) data. This folder is shared among worker machines and is retained
between cluster runs. Save data you want to retrieve on the next start of the
cluster in folders and files under
Since the content in
Deleted when cluster is deleted. |
/shared/tmp | Varies with instance type, only available if headnode is an instance with ephemeral storage (Instance Store) | This folder is shared among worker machines and is not retained between cluster runs. Use it to store temporary and intermediate data that must be visible or accessible from multiple worker machines. The available storage space depends on the ephemeral storage available on the selected machine instance type. Deleted when cluster is stopped. |
/mnt/localebs0 | 100 GB | This mount point is created only if /shared/persisted is not
enabled and it is available only on the headnode for MATLAB Job Scheduler data and
history. The data is retained between cluster restarts. |
/mnt/localnvmeN | Specified in cluster configuration | Only available and automatically enabled on instance types that contain "d"
in their name, for example, m5ad.24xlarge. This storage volume is backed by
ephemeral storage. Each ephemeral storage device (NVMe SSD) is mounted at
|
Note:
File sharing is enabled only among machines within the same cluster, no file sharing across machines from different clusters.
You create, start, stop, and delete your cloud clusters independent of your local MATLAB session (MATLAB running from your local desktop or MATLAB session in MATLAB Online). Deleting an associated cluster object in MATLAB does not affect the cloud cluster or its storage.
When a cluster times out, it shuts down and clears the contents of the root operating system volume,
/shared/tmp, and/mnt/localdataN, but preserves the content of/shared/persisted. If you use an automatic shutdown setting for your cluster, ensure that you have all data you need from/shared/tmp,/mnt/localdataand/mnt/localdataNbefore this timeout occurs.The contents of
/shared/tmpare built using ephemeral storage (AWS Instance Store).