Identify entities from images
The following APIs will enable you to identify entities (faces and/or celebrities) from images.
For identifying entities on iOS we use both AWS backend services as well as Apple's on-device Core ML Vision Framework to provide you with the most accurate results. If your device is offline, we will return results only from Core ML. On the other hand, if you are able to connect to AWS Services, we will return a unioned result from both the service and Core ML. Switching between backend services and Core ML is done automatically without any additional configuration required.
Set up your backend
If you haven't already done so, run amplify init
inside your project and then amplify add auth
(we recommend selecting the default configuration).
Run amplify add predictions
, then use the following answers:
? Please select from one of the categories below (Use arrow keys)❯ Identify Convert Interpret Infer Learn More ? What would you like to identify? Identify Text❯ Identify Entities Identify Labels
? Provide a friendly name for your resource <Enter a friendly name here>
? Would you like use the default configuration? (Use arrow keys)❯ Default Configuration Advanced Configuration
? Who should have access? Auth users only❯ Auth and Guest users
Run amplify push
to create the resources in the cloud
Working with the API
In order to match entities from a pre-created Amazon Rekognition Collection, ensure that both collectionId
and maxEntities
are set in your amplifyconfiguration.json
file. The value of collectionId
should be the name of your collection that you created either with the CLI or the SDK. The value of maxEntities
should be a number greater than 0
or less than 51
(50 is the max number of entities Rekognition can detect from a collection). If both collectionId
and maxEntities
do not have valid values in the amplifyconfiguration.json
file, then this call will just detect entities in general with facial features, landmarks, etc. Bounding boxes for entities are returned as ratios so make sure if you would like to place the bounding box of your entity on an image that you multiple the x by the width of the image, the y by the height of the image, and both height and width ratios by the image's respective height and width.
You can identify entity matches from your Rekognition Collection in your app using the following code sample:
func detectEntities(_ image: URL) async throws -> [Predictions.Entity] { do { let result = try await Amplify.Predictions.identify(.entities, in: image) print("Identified entities: \(result.entities)") return result.entities } catch let error as PredictionsError { print("Error identifying entities: \(error)") throw error } catch { print("Unexpected error: \(error)") throw error } }}
func detectEntities(_ image: URL) -> AnyCancellable { Amplify.Publisher.create { try await Amplify.Predictions.identify(.entities, in: image) } .sink(receiveCompletion: { completion in if case let .failure(error) = completion { print("Error identifying entities: \(error)") } }, receiveValue: { value in print("Identified entities: \(value.entities)") })}
Detecting Celebrities
To detect celebrities you can pass in .detectCelebrity
in the type:
field. Results are mapped to IdentifyCelebritiesResult
. For example:
func detectCelebrities(_ image: URL) async throws -> [Predictions.Celebrity] { do { let result = try await Amplify.Predictions.identify(.celebrities, in: image) let celebrities = result.celebrities let celebritiesNames = celebrities.map(\.metadata.name) print("Identified celebrities with names: \(celebritiesNames)") return celebrities } catch let error as PredictionsError { print("Error identifying celebrities: \(error)") throw error } catch { print("Unexpected error: \(error)") throw error }}
func detectCelebrities(_ image: URL) -> AnyCancellable { Amplify.Publisher.create { try await Amplify.Predictions.identify(.celebrities, in: image) } .sink(receiveCompletion: { completion in if case let .failure(error) = completion { print("Error identifying celebrities: \(error)") } }, receiveValue: { value in print("Identified celebrities with names: \(value.celebrities.map(\.metadata.name))") })}