Complex reaction networks comprising of thousands of intermediates and reactions are ubiquitous in heterogeneous catalysis; developing mechanistic models of such systems is computationally intractable primarily due to the cost of computing the energies of reactions and species using an ab initio method such as density functional theory (DFT). Developing machine learned energy potentials trained on DFT energies offer a tractable way of building mechanistic models for these systems. To this end, the contribution of this talk is two-fold. First, we develop a novel deep neural network model to learn and predict the potential energy surface (PES) for periodic systems. The novel architecture integrates graph embedding of individual atoms, atomic interactions and periodic boundary conditions with physics-aware feature engineering, residual learning, and attention mechanism. Second, we train the model with an efficiently distributed Hessian-free optimization method and deliver a prediction within chemical accuracy, which demonstrates the potential of second order methods in the joint field of deep learning and quantum chemistry. We will use examples from the field of heterogeneous catalysis, including (i) alkane conversion on molybdenum sulfide and (ii) adsorption of hydrocarbons in zeolites.