National Repository of Grey Literature 3 records found  Search took 0.01 seconds. 
Generative Neural Network for Creating Synthetic Photorealistic Images
Hora, Adam ; Přinosil, Jiří (referee) ; Říha, Kamil (advisor)
The main objective of this work is to select and design a neural network model that will be able to generate realistic images thematically fitting the selected dataset. The architecture used for the solution is Deep convolutional generative adversarial network. This network is than implemented in the Python programming language using the Tensorflow application programming interface and its included interface Keras. Finally, the model is trained on the selected dataset and the resulting generated images are presented. The final model and individual images are then evaluated using various quality assessment methods.
Learning the Face Behind a Voice
Zubalík, Petr ; Mošner, Ladislav (referee) ; Plchot, Oldřich (advisor)
The main goal of this thesis is to design and implement a system that will be able to generate a face based on the speech of a given person. This problem is solved using a system composed of three convolutional neural network models. The first one is based on the ResNet architecture and is used to extract features from speech recordings. The second model is a fully convolutional neural network which converts the extracted features into the styles which form a base for the final facial image. These styles are then passed as an input to the StyleGAN generator, which creates the resulting face. The proposed system is implemented in the Python programming language using the PyTorch framework. The last chapter of the thesis discusses some of the most significant experiments performed to fine-tune and test the developed system.
Learning the Face Behind a Voice
Zubalík, Petr ; Mošner, Ladislav (referee) ; Plchot, Oldřich (advisor)
The main goal of this thesis is to design and implement a system that will be able to generate a face based on the speech of a given person. This problem is solved using a system composed of three convolutional neural network models. The first one is based on the ResNet architecture and is used to extract features from speech recordings. The second model is a fully convolutional neural network which converts the extracted features into the styles which form a base for the final facial image. These styles are then passed as an input to the StyleGAN generator, which creates the resulting face. The proposed system is implemented in the Python programming language using the PyTorch framework. The last chapter of the thesis discusses some of the most significant experiments performed to fine-tune and test the developed system.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.