Recently, videos of modern people traveling back to ancient battlefields to do live - streaming have become very popular. Since they are quite interesting, I trained a direct - output model. It doesn't require complex prompt words. You just need to say "Self - portrait of ***", where "***" can be various animals or people. Applicable to: First - person - perspective short videos / Digital human live - streaming / Documentary photography