Multi-scale persistent spatiotemporal transformer for long-term urban traffic flow prediction