← Back to all posts

Facial Recognition - NSFW/Person of Interest

Domain/Problem Statement:

CivitAI provides an extremely broad and deep range of real person of interest (PoI) imitating models
Using these models to generate NSFW images and posting them to the CivitAI site is a breach of ToS and is a frequent occurence
Detection of these images relies on a blend of facial recognition and human intutition
In the case where a PoI is particularly well known, this is likely to be quickly filtered out using available tools.
However, if a PoI is less well known (outside the anglosphere, for instance), the images can go undetected, or if they are detected by a member of the community may be dismissed by moderators if they are not recognisable, or not present in the data set of recognition tools.
CivitAI aims to host sponsored advertisements in the near future, and as such must be effective in maintaining user-generated content to a high standard of compliance with the ToS.

Proposed changes:

For automation:

automated tools for facial recognition must be used when indicated by pre-existing tagging solutions (NSFW detected), and automatically reject at a high level of confidence, or generate a ticket in less certain cases
broad/broader/broadest datasets must be identified and incorporated into existing tooling where possible
tooling should err towards caution (i.e., ads must not appear next to unauthorized or lewd celebrity impersonations, or otherwise).

For creators:

LORA/Embedding creators must be able to submit matching images (from training data) for their celebrity creations, which can be incoporated into facial recognition matching
User-submitted matching images should be manually verified before submitting to facial recognition tooling
Users should be incentivised to submit accurate images from training data when creating a PoI model
Users could be unable to publish PoI models without providing accurate images from training data
Users should be advised if their PoI isn't present in recognised detection data

For image generators:

Users must be informed if their NSFW is detected to contain a PoI likeness
Users must be given an opportunity to appeal the detection

For Moderators:

Moderators must be able to utilise the tool on demand with preexisting matching information
Moderators must be able to provide 1 or more images of a missing PoI to the matching model for evaluation of images (sample use case: a LORA based on a retired 1990s glamour model is used to generate infringing images, the model is not present in any acknowledge data set but is identified by the community)
Moderators must be able to flag inappropriate or inaccurate detection data supplied by creators
Moderators must able to and train to see and interpret underlying matching data to provide feedback to users and further develop the tool

For PoIs:

PoIs should able to declare after verification directly or via representative that they do not consent to their likeness to be used in CivitAI image data at all, and as such any images detected using this likeness are treated as infringing ToS
PoIs must be protected from infringing likenesses (including those which are incidental or unintentional)

A robust solution, as described above, shall:

be self-improving
be open source for external review by all interested parties
be reviewed by independent experts/concerned bodies
be accurate on a variety of medium
reduce overall administrative and moderation burden
improve moderation accuracy and trust within the community
provide CivitAI with a robust defense in the event of infringement
provide other stakeholders with confidence in CivitAI
provide pubic safety
protect the rights of PoI

A potential subset or extension to this service could also be developed to identify at-risk persons, CSAM and so forth.

Please authenticate to join the conversation.

Upvoters

Status

Awaiting Dev Review

Board

💡 Feature Request

Date

About 2 years ago

Author

MajMorse

Subscribe to post

Get notified by email when there are changes.

Upvoters

Status

Awaiting Dev Review

Board

💡 Feature Request

Date

About 2 years ago

Author

MajMorse

Subscribe to post

Get notified by email when there are changes.