@User there's a pretty clear explanation of the capacitance model in the BSIM user manual. It's just a set of partial differentiation between charge into a node relative to a voltage change on a different node.
If you consider your example of Cgd and Cdg that's a Miller capacitance that looks different depending on which 'direction' you look at due to unitary voltage gain