I spend a lot of time building projects in PyTorch and have not yet found a nice structure that encapsulates everything I want to do in a clean, decoupled way.
Mostly I want to decouple :
- model definition
- data management (preprocessing, dataloaders, I/O)
- training
- evaluation
- logging
My current approach would be to use a class handling all data management issues, and a class containing the network definition.
Training and evaluation would be performed by different classes, let’s call them “engines”. Those engines would have the Data and Model class as attributes, and a logging as well.
How do you structure your own project ? Are there certain caveats to having an object as an attribute of multiple different objects ?
Let’s start a discussion !