dc.contributor.advisor | Hochreiter, Sepp | |
dc.contributor.author | Samwald, Christian | |
dc.date.accessioned | 2024-03-12T11:35:36Z | |
dc.date.available | 2024-03-12T11:35:36Z | |
dc.date.issued | 2021 | |
dc.date.submitted | 2021-03-22 | |
dc.identifier.uri | https://dspace.jcu.cz/handle/20.500.14390/44822 | |
dc.description.abstract | Delayed rewards are detrimental to the learning of reinforcement learning agents.One approach to this problem is the usage of return decomposition and rewardredistribution. It was realised in the Align-RUDDER algorithm of Patilet al.[14].Their solution employed the multiple sequence alignment algorithm Clustal W. Iintegrated the sequence alignment Tool Clustal, Clustal W's successor, intoAlign RUDDER to increase efficiency. During the testing process, the usage ofClustal's EPA function and the effects of different sample sizes played a centralrole. The data set that was used came from the MineRL data set [6]. | cze |
dc.format | 40 p. | |
dc.format | 40 p. | |
dc.language.iso | eng | |
dc.publisher | Jihočeská univerzita | cze |
dc.rights | Bez omezení | |
dc.subject | Clustal | cze |
dc.subject | Reinforcement Learning | cze |
dc.subject | Bioinformatics | cze |
dc.subject | Sequence alignment | cze |
dc.subject | Multiple sequence Alignment | cze |
dc.subject | Align-RUDDER | cze |
dc.subject | Clustal | eng |
dc.subject | Reinforcement Learning | eng |
dc.subject | Bioinformatics | eng |
dc.subject | Sequence alignment | eng |
dc.subject | Multiple sequence Alignment | eng |
dc.subject | Align-RUDDER | eng |
dc.title | Effects of hyperparameters in multiple sequence alignment for Align-RUDDER using Clustal | cze |
dc.title.alternative | Effects of hyperparameters in multiple sequence alignment for Align-RUDDER using Clustal | eng |
dc.type | bakalářská práce | cze |
dc.identifier.stag | 63078 | |
dc.description.abstract-translated | Delayed rewards are detrimental to the learning of reinforcement learning agents.One approach to this problem is the usage of return decomposition and rewardredistribution. It was realised in the Align-RUDDER algorithm of Patilet al.[14].Their solution employed the multiple sequence alignment algorithm Clustal W. Iintegrated the sequence alignment Tool Clustal, Clustal W's successor, intoAlign RUDDER to increase efficiency. During the testing process, the usage ofClustal's EPA function and the effects of different sample sizes played a centralrole. The data set that was used came from the MineRL data set [6]. | eng |
dc.date.accepted | 2021-03-24 | |
dc.description.department | Přírodovědecká fakulta | cze |
dc.thesis.degree-discipline | Bioinformatics | cze |
dc.thesis.degree-grantor | Jihočeská univerzita. Přírodovědecká fakulta | cze |
dc.thesis.degree-name | Bc. | |
dc.thesis.degree-program | Applied Informatics | cze |
dc.description.grade | Dokončená práce s úspěšnou obhajobou | cze |
dc.contributor.referee | Hofmarcher, Markus | |