National Repository of Grey Literature 5 records found  Search took 0.01 seconds. 
Learning the Face Behind a Voice
Zubalík, Petr ; Mošner, Ladislav (referee) ; Plchot, Oldřich (advisor)
The main goal of this thesis is to design and implement a system that will be able to generate a face based on the speech of a given person. This problem is solved using a system composed of three convolutional neural network models. The first one is based on the ResNet architecture and is used to extract features from speech recordings. The second model is a fully convolutional neural network which converts the extracted features into the styles which form a base for the final facial image. These styles are then passed as an input to the StyleGAN generator, which creates the resulting face. The proposed system is implemented in the Python programming language using the PyTorch framework. The last chapter of the thesis discusses some of the most significant experiments performed to fine-tune and test the developed system.
Increasing quality of facial images using sequence of images
Svorad, Adam ; Mezina, Anzhelika (referee) ; Burget, Radim (advisor)
Diplomova praca sa zameriava na oblast zaostrovania obrazkov tvari. V teoretickej casti prace budu prezentovane moderne metody zaostrovania obrazkov pomocou jedineho obrazku a metody editacie obrazkov. Prakticka cast sa zameria na pristupy rekonstrukcie obrazkov zo sekvencie poskodenych obrazkov. Viacere modely neuronovych sieti so vstupom pre viacero obrazkov budu zhotovene a vyhodnotene. Alternativny pristup v podobe balika nastrojov na editaciu obrazkov bude taktiez predstaveny. Tieto nastroje budu vyuzivat najmodernejsie pristupy k editacii obrazkov s cielom spojit vizualne prvky tvari zo vstupnej sekvencie obrazkov do jedneho finalneho vystupu. V zavere prace budu vsetky metody navzajom porovnane.
GAN Generated Data for CNN Age Estimation
Venkrbec, Tomáš ; Herout, Adam (referee) ; Hradiš, Michal (advisor)
The goal of this thesis is to implement one of the state-of-the-art methods of generative adversarial networks and to propose its extension to conditional generation. This has been used to generate photorealistic images of human faces with specified characteristics such as age and gender. For this purpose, a highly diverse dataset of over 230,000 samples was created by merging and cleaning existing annotated face datasets. All ages, genders and different ethnic groups are well represented in it. StyleGAN2 generator trained on this dataset achieved a FID of 7.14. The synthetic data ratio was then experimented with during age classifier training. For the test subset of the dataset, the addition of synthetic data achieved a reduction in the mean absolute error from 3.499 years to 3.294 years. For the independent test dataset, a reduction in mean error from 4.012 years to 3.875 years was achieved.
Learning the Face Behind a Voice
Zubalík, Petr ; Mošner, Ladislav (referee) ; Plchot, Oldřich (advisor)
The main goal of this thesis is to design and implement a system that will be able to generate a face based on the speech of a given person. This problem is solved using a system composed of three convolutional neural network models. The first one is based on the ResNet architecture and is used to extract features from speech recordings. The second model is a fully convolutional neural network which converts the extracted features into the styles which form a base for the final facial image. These styles are then passed as an input to the StyleGAN generator, which creates the resulting face. The proposed system is implemented in the Python programming language using the PyTorch framework. The last chapter of the thesis discusses some of the most significant experiments performed to fine-tune and test the developed system.
Increasing quality of facial images using sequence of images
Svorad, Adam ; Mezina, Anzhelika (referee) ; Burget, Radim (advisor)
Diplomova praca sa zameriava na oblast zaostrovania obrazkov tvari. V teoretickej casti prace budu prezentovane moderne metody zaostrovania obrazkov pomocou jedineho obrazku a metody editacie obrazkov. Prakticka cast sa zameria na pristupy rekonstrukcie obrazkov zo sekvencie poskodenych obrazkov. Viacere modely neuronovych sieti so vstupom pre viacero obrazkov budu zhotovene a vyhodnotene. Alternativny pristup v podobe balika nastrojov na editaciu obrazkov bude taktiez predstaveny. Tieto nastroje budu vyuzivat najmodernejsie pristupy k editacii obrazkov s cielom spojit vizualne prvky tvari zo vstupnej sekvencie obrazkov do jedneho finalneho vystupu. V zavere prace budu vsetky metody navzajom porovnane.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.