Show Thoughts

It took Kriszhevsky et al. five to six days to train their network on top-notch hardware available in 2012.