Amplify has re-imagined the way frontend developers build fullstack applications. Develop and deploy without the hassle.

Page updated Apr 29, 2024

Syncing data to cloud

Once you're happy with your application, you can start syncing with the cloud by provisioning a backend from your project. DataStore can connect to remote backend and automatically sync all locally saved data using GraphQL as a data protocol.

Best practice: it is recommended to develop without cloud synchronization enabled initially so you can change the schema as your application takes shape without the impact of having to update the provisioned backend. Once you are satisfied with the stability of your data schema, setup cloud synchronization as described below and the data saved locally will be synchronized to the cloud automatically.

Setup cloud sync

Synchronization between offline and online data can be tricky. DataStore's goal is to remove that burden from the application code and handle all data consistency and reconciliation between local and remote behind the scenes, while developers focus on their application logic. Up to this point the focus was to setup a local data store that works offline and has all the capabilities you would expect from a data persistence framework.

The next step is to make sure the locally saved data is synchronized with a cloud backend powered by AWS AppSync.

Note: Syncing data between the cloud and the local device starts automatically whenever you run any DataStore operation after your app is set up.

Add the API plugin

Although DataStore presents a distinct API, its cloud synchronization functionality relies on the underlying API category. Therefore, you will still be required to incorporate the API plugin when working with DataStore.

Add AWSAPIPlugin in your Amplify initialization code alongside with the previously added AWSDataStorePlugin.

try Amplify.add(plugin: AWSAPIPlugin())

Push the backend to the cloud

By now you should have a backend created with conflict detection enabled, as described in the Getting started guide.

Check the status of the backend to verify if it is already provisioned in the cloud.

amplify status

You should see a table similar to this one.

| Category | Resource name | Operation | Provider plugin |
| -------- | ----------------- | --------- | ----------------- |
| Api | amplifyDatasource | No Change | awscloudformation |

Troubleshooting: if amplify status gives you an error saying "You are not working inside a valid Amplify project", make sure you run amplify init before the next step.

In case Operation says Create or Update you need to push the backend to the cloud.

amplify push

AWS credentials needed. At this point an AWS account is required. If you have never run amplify configure before, do it so and follow the steps to configure Amplify with your AWS account. Details can be found in the Configure the Amplify CLI guide.

Existing backend

DataStore can connect to an existing AWS AppSync backend that has been deployed from another project, no matter the platform it was originally created in. In these workflows it is best to work with the CLI directly by running an amplify pull command from your terminal and then generating models afterwards, using the process described in the Getting started guide.

For more information on this workflow please see the Multiple Frontends documentation.

Distributed data

When working with distributed data, it is important to be mindful about the state of the local and the remote systems. DataStore tries to make that as simple as possible for you; however, some scenarios might require some consideration.

For instance, when updating or deleting data, one has to consider that the state of the local data might be out-of-sync with the backend. This scenario can affect how conditions should be implemented.

Update and delete with predicate

For such scenarios both the save() and the delete() APIs support an optional predicate which will be sent to the backend and executed against the remote state.

do {
try await Amplify.DataStore.save(
post,
where: Post.keys.title.beginsWith("[Amplify]"))
print("Post updated successfully!")
} catch let error as DataStoreError {
print("Could not update post, maybe the title has been changed? - \(error)")
} catch {
print("Unexpected error \(error)")
}
let sink = Amplify.Publisher.create {
try await Amplify.DataStore.save(
post,
where: Post.keys.title.beginsWith("[Amplify]"))
}.sink {
if case let .failure(error) = $0 {
print("Could not update post, maybe the title has been changed? - \(error)")
}
}
receiveValue: { _ in
print("Post updated successfully!")
}

There's a difference between the traditional local condition check using if/else constructs and the predicate in the save() and delete() APIs as you can see in the example below.

// Tests only against the local state
if post.title.starts(with: "[Amplify]") {
let savedPost = try await Amplify.DataStore.save(post)
}
// Only applies the update if the data in the remote backend satisfies the criteria
let savedPost = try await Amplify.DataStore.save(
post,
where: Post.keys.title.beginsWith("[Amplify]")
)

Conflict detection and resolution

When concurrently updating the data in multiple places, it is likely that some conflict might happen. For most of the cases the default Auto-merge algorithm should be able to resolve conflicts. However, there are scenarios where the algorithm won't be able to be resolved, and in these cases, a more advanced option is available and will be described in detail in the conflict resolution section.

Clear local data

Amplify.DataStore.clear() provides a way for you to clear all local data if needed. This is a destructive operation but the remote data will remain intact. When the next sync happens, data will be pulled into the local storage again and reconstruct the local data.

One common use for clear() is to manage different users sharing the same device or even as a development-time utility.

Note: In case multiple users share the same device and your schema defines user-specific data, make sure you call Amplify.DataStore.clear() when switching users. Visit Auth events for all authentication related events.

let isSignedOut = HubFilters.forEventName(HubPayload.EventName.Auth.signedOut)
let token = Amplify.Hub.listen(to: .auth, isIncluded: isSignedOut) { payload in
Task {
do {
try await Amplify.DataStore.clear()
print("Local data cleared successfully.")
} catch let error as DataStoreError {
print("Error clearing DataStore \(error)")
} catch {
print("Unexpected error \(error)")
}
}
}
let isSignedOut = HubFilters.forEventName(HubPayload.EventName.Auth.signedOut)
let sink = Amplify.Hub.publisher(for: .auth)
.setFailureType(to: DataStoreError.self)
.filter { isSignedOut($0) }
.sink { _ in }
receiveValue: { _ in
Task {
do {
try await Amplify.DataStore.clear()
print("Local data cleared successfully.")
} catch {
print("Local data not cleared \(error)")
}
}
}

This is a simple yet effective example. However, in a real scenario you might want to only call clear() when a different user is signedIn in order to avoid clearing the database for a repeated sign-in of the same user.

Selectively syncing a subset of your data

By default, DataStore fetches all the records that you’re authorized to access from your cloud data source to your local device. The maximum number of records that will be stored locally is configurable here.

You can utilize selective sync to persist a subset of your data instead.

Selective sync works by applying predicates to the base and delta sync queries, as well as to incoming subscriptions.

Note that selective sync is applied on top of authorization rules you’ve defined on your schema with the @auth directive. For more information see the Setup authorization rules section.

let syncExpr1 = DataStoreSyncExpression.syncExpression(Post.schema) {
Post.keys.rating.gt(5)
}
let syncExpr2 = DataStoreSyncExpression.syncExpression(Comment.schema) {
Comment.keys.content.beginsWith("the")
}
try Amplify.add(plugin: AWSDataStorePlugin(
modelRegistration: AmplifyModels(),
configuration: .custom(syncExpressions: [syncExpr1, syncExpr2])
))

When DataStore starts syncing, only Posts with rating > 5 and Comments with status equal to active will be synced down to the user's local store.

Developers should only specify a single syncExpression per model. Any subsequent expressions for the same model will be ignored.

Reevaluate expressions at runtime

Sync expressions get evaluated whenever DataStore starts. In order to have your expressions reevaluated, you can execute Amplify.DataStore.clear() or Amplify.DataStore.stop() followed by Amplify.DataStore.start().

If you have the following expression and you want to change the filter that gets applied at runtime, you can do the following:

public var rating = 5
func initialize() {
do {
let variableSyncExpr = DataStoreSyncExpression.syncExpression(Post.schema) {
Post.keys.rating.gt(self.rating)
}
try Amplify.add(plugin: AWSDataStorePlugin(
modelRegistration: AmplifyModels(),
configuration: .custom(syncExpressions: [variableSyncExpr])
))
} catch {
print("Failed to initialize Amplify with \(error)")
}
}
func changeSync() {
rating = 1
do {
try await Amplify.DataStore.stop()
print("DataStore stopped")
try await Amplify.DataStore.start()
print("DataStore started")
} catch let error as DataStoreError {
print("Failed with error \(error)")
} catch {
print("Unexpected error \(error)")
}
}

Each time DataStore starts (via start or any other operation: query, save, delete, or observe), DataStore will reevaluate the syncExpressions.

In the above case, the predicate will contain the value 1, so all Posts with rating > 1 will get synced down.

Keep in mind: Amplify.DataStore.stop() will retain the local store's existing content. Run Amplify.DataStore.clear() to clear the locally-stored contents.

When applying a more restrictive filter, clear the local records first by running DataStore.clear() instead:

func changeSync() {
rating = 8
do {
try await Amplify.DataStore.stop()
print("DataStore stopped")
try await Amplify.DataStore.start()
print("DataStore started")
} catch let error as DataStoreError {
print("Failed with error \(error)")
} catch {
print("Unexpected error \(error)")
}
}

This will clear the contents of your local store, reevaluate your sync expressions and re-sync the data from the cloud, applying all of the specified predicates to the sync queries.

You can also have your sync expression return QueryPredicateConstant.all in order to remove any filtering for that model. This will have the same effect as the default sync behavior.

public var rating: Int? = 5
func initialize() {
let syncExpr = DataStoreSyncExpression.syncExpression(Post.schema) {
guard let rating = self.rating else {
return QueryPredicateConstant.all
}
return Post.keys.rating.gt(rating)
}
do {
try Amplify.add(plugin: AWSDataStorePlugin(
modelRegistration: AmplifyModels(),
configuration: .custom(syncExpressions: [syncExpr])
))
} catch {
print("Failed to initialize Amplify with \(error)")
}
}

DataStore.configure() should only by called once.

Advanced use case - Query instead of Scan

You can configure selective sync to retrieve items from DynamoDB with a query operation against a GSI. By default, the base sync will perform a scan. Query operations enable a highly efficient and cost-effective data retrieval for customers running DynamoDB at scale. Learn about creating GSIs with the @index directive here.

In order to do that, your syncExpression should return a predicate that maps to a query expression.

For example, for the following schema:

type User @model {
id: ID!
firstName: String!
lastName: String! @index(name: "byLastName", sortKeyFields: ["createdAt"])
createdAt: AWSDateTime!
}

To construct a query expression, return a predicate with the primary key of the GSI. You can only use the eq operator with this predicate.

For the schema defined above User.keys.lastName.eq("Doe") is a valid query expression.

Optionally, you can also chain the sort key to this expression, using any of the following operators: eq | ne | le | lt | ge | gt | beginsWith | between.

E.g., User.keys.lastName.eq("Doe").and(User.keys.createdAt.gt("2020-10-10").

Both of these sync expressions will result in AWS AppSync retrieving records from Amazon DynamoDB via a query operation:

let syncExpr = DataStoreSyncExpression.syncExpression(User.schema) {
User.keys.lastName.eq("Doe")
}
try Amplify.add(plugin: AWSDataStorePlugin(
modelRegistration: AmplifyModels(),
configuration: .custom(syncExpressions: [syncExpr])
))
// OR
let syncExpr = DataStoreSyncExpression.syncExpression(User.schema) {
User.keys.lastName.eq("Doe").and(User.keys.createdAt.gt("2020-10-10"))
}
try Amplify.add(plugin: AWSDataStorePlugin(
modelRegistration: AmplifyModels(),
configuration: .custom(syncExpressions: [syncExpr])
))