Matches in SemOpenAlex for { <https://semopenalex.org/work/W4313157089> ?p ?o ?g. }
Showing items 1 to 60 of
60
with 100 items per page.
- W4313157089 endingPage "14" @default.
- W4313157089 startingPage "1" @default.
- W4313157089 abstract "Manipulating visual attributes of an image through a natural language description, known as text-to-image attributes manipulation (T2AM), is a challenging task. However, existing approaches tend to search the whole image to manipulate the target instance indicated by a description, thus they often fail to locate and manipulate the accurate text-relevant regions, and even disturb the text-irrelevant contents, e.g. texture and background. Meanwhile, the model efficiency needs to be improved. To tackle the above issues, we introduce a novel yet simple GAN-based approach, namely <italic xmlns:mml=http://www.w3.org/1998/Math/MathML xmlns:xlink=http://www.w3.org/1999/xlink><u>S</u>tructuring <u>I</u>mage for <u>M</u>anipulating</i> (SIMGAN), to narrow down the optimization areas from external to internal. It consists of two major components: 1) <italic xmlns:mml=http://www.w3.org/1998/Math/MathML xmlns:xlink=http://www.w3.org/1999/xlink>External Structuring</i> (ExST), a pretrained segmentation network, for recognizing and separating the target instances and background from an image; and 2) <italic xmlns:mml=http://www.w3.org/1998/Math/MathML xmlns:xlink=http://www.w3.org/1999/xlink>Internal Structuring</i> (InST) for seeking out and editing the text-relevant attributes of the target instances based on the given description and masked hierarchical image representations from ExST. Specifically, the InST structures target instances from outline to detail by firstly drawing the sketch and colors underpainting of instances with an <italic xmlns:mml=http://www.w3.org/1998/Math/MathML xmlns:xlink=http://www.w3.org/1999/xlink>Outline-Oriented Structuring</i> (OuST), and then enhancing the text-relevant attributes and elaborating on details with a <italic xmlns:mml=http://www.w3.org/1998/Math/MathML xmlns:xlink=http://www.w3.org/1999/xlink>Detail-Oriented Structuring</i> (DeST). Extensive experiments on benchmark datasets demonstrate that our framework significantly outperforms state-of-the-art both quantitatively and qualitatively. Compared with the state-of-the-art method ManiGAN, our approach reduces the training time by 88%, while the inferring time is three times faster. In addition, our approach is easily extended to solve the instance-level image-to-image translation problem, and the results exhibit the versatility and effectiveness of our approach. We release our code in <uri xmlns:mml=http://www.w3.org/1998/Math/MathML xmlns:xlink=http://www.w3.org/1999/xlink>https://github.com/qikizh/SIMGAN</uri> ." @default.
- W4313157089 created "2023-01-06" @default.
- W4313157089 creator A5014087220 @default.
- W4313157089 creator A5035846349 @default.
- W4313157089 creator A5051338650 @default.
- W4313157089 creator A5060524512 @default.
- W4313157089 creator A5076704054 @default.
- W4313157089 creator A5087225139 @default.
- W4313157089 date "2022-01-01" @default.
- W4313157089 modified "2023-09-26" @default.
- W4313157089 title "From External to Internal: Structuring Image for Text-to-Image Attributes Manipulation" @default.
- W4313157089 doi "https://doi.org/10.1109/tmm.2022.3219677" @default.
- W4313157089 hasPublicationYear "2022" @default.
- W4313157089 type Work @default.
- W4313157089 citedByCount "0" @default.
- W4313157089 crossrefType "journal-article" @default.
- W4313157089 hasAuthorship W4313157089A5014087220 @default.
- W4313157089 hasAuthorship W4313157089A5035846349 @default.
- W4313157089 hasAuthorship W4313157089A5051338650 @default.
- W4313157089 hasAuthorship W4313157089A5060524512 @default.
- W4313157089 hasAuthorship W4313157089A5076704054 @default.
- W4313157089 hasAuthorship W4313157089A5087225139 @default.
- W4313157089 hasConcept C10138342 @default.
- W4313157089 hasConcept C11413529 @default.
- W4313157089 hasConcept C115961682 @default.
- W4313157089 hasConcept C154945302 @default.
- W4313157089 hasConcept C162324750 @default.
- W4313157089 hasConcept C204321447 @default.
- W4313157089 hasConcept C23123220 @default.
- W4313157089 hasConcept C2775945657 @default.
- W4313157089 hasConcept C2779231336 @default.
- W4313157089 hasConcept C41008148 @default.
- W4313157089 hasConceptScore W4313157089C10138342 @default.
- W4313157089 hasConceptScore W4313157089C11413529 @default.
- W4313157089 hasConceptScore W4313157089C115961682 @default.
- W4313157089 hasConceptScore W4313157089C154945302 @default.
- W4313157089 hasConceptScore W4313157089C162324750 @default.
- W4313157089 hasConceptScore W4313157089C204321447 @default.
- W4313157089 hasConceptScore W4313157089C23123220 @default.
- W4313157089 hasConceptScore W4313157089C2775945657 @default.
- W4313157089 hasConceptScore W4313157089C2779231336 @default.
- W4313157089 hasConceptScore W4313157089C41008148 @default.
- W4313157089 hasLocation W43131570891 @default.
- W4313157089 hasOpenAccess W4313157089 @default.
- W4313157089 hasPrimaryLocation W43131570891 @default.
- W4313157089 hasRelatedWork W2001372204 @default.
- W4313157089 hasRelatedWork W2070852123 @default.
- W4313157089 hasRelatedWork W2091737559 @default.
- W4313157089 hasRelatedWork W2111106076 @default.
- W4313157089 hasRelatedWork W2600707098 @default.
- W4313157089 hasRelatedWork W2983439719 @default.
- W4313157089 hasRelatedWork W3107474891 @default.
- W4313157089 hasRelatedWork W3131073374 @default.
- W4313157089 hasRelatedWork W4244299974 @default.
- W4313157089 hasRelatedWork W3106945349 @default.
- W4313157089 isParatext "false" @default.
- W4313157089 isRetracted "false" @default.
- W4313157089 workType "article" @default.