/ /

/ /

Reshard a Collection

The ideal shard key allows MongoDB to distribute documents evenly throughout the cluster while facilitating common query patterns. A suboptimal shard key can lead to performance or scaling issues due to uneven data distribution. You can change the shard key for a collection to change the distribution of your data across a cluster.

Starting in MongoDB 8.0, you can reshard a collection on the same shard key, allowing you to redistribute data to include new shards or to different zones without changing your shard key. To reshard to the same shard key, set forceRedistribution to true.

Starting in MongoDB 8.0.10, you can reshard a time series collection. All shards in the time series collection must run version 8.0.10 or later to reshard.

Note

Before resharding your collection, read Troubleshoot Shard Keys for information on common performance and scaling issues and advice on how to fix them.

About this Task

Only one collection can be resharded at a time.
writeConcernMajorityJournalDefault must be true.
To reshard a collection that has a uniqueness constraint, the new shard key must satisfy the unique index requirements for any existing unique indexes.
The following commands and corresponding shell methods are not supported on the collection that is being resharded while the resharding operation is in progress:
If you run one of the following operations, the operation waits until resharding completes before executing:
- addShard
- removeShard
- dropDatabase on the database hosting the collection undergoing resharding
If the collection you're resharding uses MongoDB Search, the search index becomes unavailable when the resharding operation completes. You need to manually rebuild the search index once the resharding operation completes.

Before you Begin

Before you begin resharding your collection, ensure that you meet the following requirements:

Your application can tolerate a period of two seconds where the affected collection blocks writes. During the time period where writes are blocked, your application experiences an increase in latency.
If your workload cannot tolerate this requirement, consider refining your shard key instead.
Your database meets these resource requirements:
- Ensure that the available storage space on each recipient shard is at least twice the storage size of the collection that you want to reshard plus its total index size, divided by the number of shards:
  ( ( collection_storage_size + index_size ) * 2 ) / shard_count = storage_req
  For example, consider a collection with a storage size of 2 TB data and a 400 GB index. To distribute it across four shards you'd need:
  ( ( 2 TB collection + 0.4 TB index ) * 2 ) / 4 shards = 1.2 TB storage
  To reshard this collection, each shard requires 1.2 TB of available storage.
  On MongoDB Atlas, you may need to upgrade to the next tier of storage for the reshard operation. You can downgrade once the operation completes.
- Ensure that your I/O capacity is below 50%.
- Ensure that your CPU load is below 80%.
Important
These requirements are not enforced by the database. A failure to allocate enough resources can result in:
- the database running out of space and shutting down
- decreased performance
- the operation taking longer than expected
If your application has time periods with less traffic, perform this operation on the collection during that time if possible.

No index builds are in progress. To check for running index builds, use $currentOp:

db.getSiblingDB("admin").aggregate( [
   { $currentOp : { idleConnections: true } },
   { $match: {
         $or: [
             { "op": "command", "command.createIndexes": { $exists: true } },
             { "op": "none", "msg": /^Index Build/ }
         ]
      }
   }
] )

In the result document, if the inprog field value is an empty array, there are no index builds in progress:

{
   inprog: [],
   ok: 1,
   '$clusterTime': { ... },
   operationTime: <timestamp>
}

You must rewrite your application's queries to use both the current shard key and the new shard key.
Tip
If your application can tolerate downtime, you can perform these steps to avoid rewriting your application's queries to use both the current and new shard keys:
1. Stop your application.
2. Rewrite your application to use the new shard key.
3. Wait until the resharding operation completes. To monitor the resharding process, use the $currentOp pipeline stage.
4. Deploy your rewritten application.
Before the resharding operation completes, the following queries return an error if the query filter does not include either the current shard key or a unique field (like _id):
For optimal performance, we recommend that you also rewrite other queries to include the new shard key.
Once the resharding operation completes, you can remove the old shard key from the queries.

Note

A resharding operation is a write-intensive process which can generate increased rates of oplog. You may wish to:

set a fixed oplog size to prevent unbounded oplog growth.
increase the oplog size to minimize the chance that one or more secondary nodes becomes stale.

See the Replica Set Oplog documentation for more details.

Steps

Important

We strongly recommend that you check the About this Task and read the Steps section in full before resharding your collection.

In a collection resharding operation, a shard can be a:

donor, which currently stores chunks for the sharded collection.
recipient, which stores new chunks for the sharded collection based on the shard keys and zones.

A shard can be donor and a recipient at the same time.

The config server primary is always the resharding coordinator and starts each phase of the resharding operation.

Disable the Balancer

You must turn off the balancer before you begin the process of resharding a collection. To disable the balancer, see here.

Start the resharding operation.

While connected to the mongos, issue a reshardCollection command that specifies the collection to be resharded and the new shard key:

db.adminCommand({
  reshardCollection: "<database>.<collection>",
  key: <shardkey>
})

MongoDB sets the max number of seconds to block writes to two seconds and begins the resharding operation.

To reshard to the same shard key, set forceRedistribution to true:

db.adminCommand({
  reshardCollection: "<database>.<collection>",
  key: <shardkey>,
  forceRedistribution: true
})

You can also use sh.reshardCollection() to reshard a collection with the same key. For an example, see Redistribute Data to New Shards.

Monitor the resharding operation.

To monitor the resharding operation, you can use the $currentOp pipeline stage:

db.getSiblingDB("admin").aggregate([
  { $currentOp: { allUsers: true, localOps: false } },
  {
    $match: {
      type: "op",
      "originatingCommand.reshardCollection": "<database>.<collection>"
    }
  }
])

Note

To see updated values, you need to continuously run the preceeding pipeline.

The $currentOp pipeline outputs:

totalOperationTimeElapsedSecs: elapsed operation time in seconds
remainingOperationTimeEstimatedSecs: estimated time remaining in seconds for the current resharding operation. It is returned as -1 when a new resharding operation starts.
Starting in MongoDB 7.0, remainingOperationTimeEstimatedSecs is also available on the coordinator during a resharding operation.
remainingOperationTimeEstimatedSecs is set to a pessimistic time estimate:
- The catch-up phase time estimate is set to the clone phase time, which is a relatively long time.
- In practice, if there are only a few pending write operations, the actual catch-up phase time is relatively short.

[
  {
    shard: '<shard>',
    type: 'op',
    desc: 'ReshardingRecipientService | ReshardingDonorService | ReshardingCoordinatorService <reshardingUUID>',
    op: 'command',
    ns: '<database>.<collection>',
    originatingCommand: {
      reshardCollection: '<database>.<collection>',
      key: <shardkey>,
      unique: <boolean>,
      collation: { locale: 'simple' }
    },
    totalOperationTimeElapsedSecs: <number>,
    remainingOperationTimeEstimatedSecs: <number>,
    ...
  },
  ...
]

Re-enable the Balancer.

To enable the balancer, see here.

Behavior

Minimum Duration of a Resharding Operation

The minimum duration of a resharding operation is always 5 minutes.

Retryable Writes

Retryable writes initiated before or during resharding can be retried during and after the collection has been resharded for up to 5 minutes. After 5 minutes you may be unable to find the definitive result of the write and subsequent attempts to retry the write fail with an IncompleteTransactionHistory error.

Error Case

Duplicate `_id` Values

The resharding operation fails if _id values are not globally unique to avoid corrupting collection data. Duplicate _id values can also prevent successful chunk migration. If you have documents with duplicate _id values, copy the data from each into a new document, and then delete the duplicate documents.

Back

Refine a Shard Key

Change Shard Key Value

Note

About this Task

Before you Begin

Important

Tip

Note

Steps

Important

Disable the Balancer

Start the resharding operation.

Monitor the resharding operation.

Note

Re-enable the Balancer.

Behavior

Minimum Duration of a Resharding Operation

Retryable Writes

Error Case

Duplicate _id Values

Duplicate `_id` Values