Identify entities from images

The following APIs will enable you to identify entities (faces and/or celebrities) from images.

For identifying entities on iOS we use both AWS backend services as well as Apple's on-device Core ML Vision Framework to provide you with the most accurate results. If your device is offline, we will return results only from Core ML. On the other hand, if you are able to connect to AWS Services, we will return a unioned result from both the service and Core ML. Switching between backend services and Core ML is done automatically without any additional configuration required.

Set up your backend

If you haven't already done so, run amplify init inside your project and then amplify add auth (we recommend selecting the default configuration).

Run amplify add predictions, then use the following answers:

? Please select from one of the categories below (Use arrow keys)
  Interpret
  Learn More
? What would you like to identify?
  Identify Text
❯ Identify Entities
  Identify Labels
? Provide a friendly name for your resource
    <Enter a friendly name here>
? Would you like use the default configuration? (Use arrow keys)
❯ Default Configuration
  Advanced Configuration
? Who should have access?
  Auth users only
❯ Auth and Guest users

Run amplify push to create the resources in the cloud

Working with the API

In order to match entities from a pre-created Amazon Rekognition Collection, ensure that both collectionId and maxEntities are set in your amplifyconfiguration.json file. The value of collectionId should be the name of your collection that you created either with the CLI or the SDK. The value of maxEntities should be a number greater than 0 or less than 51 (50 is the max number of entities Rekognition can detect from a collection). If both collectionId and maxEntities do not have valid values in the amplifyconfiguration.json file, then this call will just detect entities in general with facial features, landmarks, etc. Bounding boxes for entities are returned as ratios so make sure if you would like to place the bounding box of your entity on an image that you multiple the x by the width of the image, the y by the height of the image, and both height and width ratios by the image's respective height and width.

You can identify entity matches from your Rekognition Collection in your app using the following code sample:

func detectEntities(_ image: URL) async throws -> [Predictions.Entity] {
        let result = try await Amplify.Predictions.identify(.entities, in: image)
        print("Identified entities: \(result.entities)")
        return result.entities
    } catch let error as PredictionsError {
        print("Error identifying entities: \(error)")
        throw error
    } catch {
        print("Unexpected error: \(error)")
        throw error

func detectEntities(_ image: URL) -> AnyCancellable {
    Amplify.Publisher.create {
        try await Amplify.Predictions.identify(.entities, in: image)
    .sink(receiveCompletion: { completion in
        if case let .failure(error) = completion {
            print("Error identifying entities: \(error)")
    }, receiveValue: { value in
        print("Identified entities: \(value.entities)")

Detecting Celebrities

To detect celebrities you can pass in .detectCelebrity in the type: field. Results are mapped to IdentifyCelebritiesResult. For example:

func detectCelebrities(_ image: URL) async throws -> [Predictions.Celebrity] {
        let result = try await Amplify.Predictions.identify(.celebrities, in: image)
        let celebrities = result.celebrities
        let celebritiesNames = celebrities.map(\.metadata.name)
        print("Identified celebrities with names: \(celebritiesNames)")
        return celebrities
    } catch let error as PredictionsError {
        print("Error identifying celebrities: \(error)")
        throw error
    } catch {
        print("Unexpected error: \(error)")
        throw error

func detectCelebrities(_ image: URL) -> AnyCancellable {
    Amplify.Publisher.create {
        try await Amplify.Predictions.identify(.celebrities, in: image)
    .sink(receiveCompletion: { completion in
        if case let .failure(error) = completion {
            print("Error identifying celebrities: \(error)")
    }, receiveValue: { value in
        print("Identified celebrities with names: \(value.celebrities.map(\.metadata.name))")

Identify text

Label objects in an image