Fig. 1.
Diagram of proposed method. At training time, xu, xl and yl are supplied to the network. xu is an image from the unlabeled target domain and x̂u is the result of applying some augmentation function to xu. A labeled image, xl, is passed through the network, fθ before combining with a label yl to form the segmentation loss, ℒs. The image representations are fed to a domain discriminator d Ω which attempts to maximise the cross-entropy between predicted domain and actual domain, ℒadv. Finally, similarity is promoted between the network predictions on xu and using ℒPC.