Network dropout of about 20 percent made all the difference for my SD 1.5 models and is a powerful generalization tool. Learning rate warmup for cosine I think is essential for a finely tuned model and it would be great to adjust that.
Please authenticate to join the conversation.
Awaiting Dev Review
π‘ Feature Request
Over 1 year ago

CruzFlesh
Get notified by email when there are changes.
Awaiting Dev Review
π‘ Feature Request
Over 1 year ago

CruzFlesh
Get notified by email when there are changes.