We divided each channel of the intermediate features into different region-specific groups based on the spatial location of the top activated region of the feature map with a simple yet effective gradient-based strategy.
So we can modify the modulation styles of the convolutional layer by moving along the sparse direction vector. More specifically, we use a portion of the difference between the positive and negative sample latent code as the editing direction vector.
However, the results manipulated by these sparse direction vectors still suffer from the insufficient change or entanglement issue:
The editing results are strongly correlated with the spatial distribution of the feature maps. We should collaboratively manipulate the modulation styles and feature maps rather than individual ones to obtain the fine-grained controls.
The specific semantic region’s attribute is controlled by a few channels of intermediate feature and its corresponding modulation styles, which are represented as control units.
Visualization of a typical attribute manipulation pipeline:
Our modification consists of a optimized styles $\hat{S}^{l-1}$ and a direction vector $\Delta{S}^l$. A few channels of $F^l$ are replaced by $F^{l}_{U_a}$ computed with $\hat{S}^{l-1}$, while other channels of $F^l$ keep untouched. The original modulation style $S^l$ and $\Delta{S}^l$ are summed to form the new modulation style.
@inproceedings{10.1145/3474085.3475274, author = {Wang, Rui and Chen, Jian and Yu, Gang and Sun, Li and Yu, Changqian and Gao, Changxin and Sang, Nong}, title = {Attribute-Specific Control Units in StyleGAN for Fine-Grained Image Manipulation}, year = {2021}, isbn = {9781450386517}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, url = {https://doi.org/10.1145/3474085.3475274}, doi = {10.1145/3474085.3475274}, booktitle = {Proceedings of the 29th ACM International Conference on Multimedia}, pages = {926–934}, numpages = {9}, keywords = {generative adversarial networks(GANs), control unit, image manipulation}, location = {Virtual Event, China}, series = {MM '21} }