Skip to content
GitLab
Explore
Sign in
create a model according to metagrok's design
parse big ass obs vector passed from ENV
embed all the one-hots
output both Policy and Value -> for pfrl agents to handle and learn