Neural Fields Meet Explicit Geometric Representations for Inverse Rendering of Urban Scenes