I've had a thought. Yes, it hurt. I'm pretty sure other posters touched on it, too.
Your eyes and brain have a sort of built-in response to vibrations, knocks - basically any fast head movements. Try this: look at yourself in a mirror and focus on your eyes, then tilt your head to one side, then the other, and notice how your eyes rotate so that the "picture" stays level (up to a point; also the different height of the eyes messes with focusing and stereo imaging a little bit). We aren't conscious of this process, only that we can remarkably still make sense of the world when it's on the slant - i.e., everything still looks "level", except in the extreme. Remember also the actually remarkable ability of the eye to track objects of interest. Again in the mirror, notice that whichever way you turn your head, you always remain perfectly focused on what you're looking at - your eyes move automatically to compensate. This is limited, though, and you only have to waggle your finger in front of you to see that you'll soon lose track of fast moving images. (Notice how much easier it is to track your finger if you allow your head to move).
Now, suppose your head is absolutely stationary, but the world decides to tilt. It would play havoc with your brain (there are theme-park rides that exploit this). Imagine that you recreate a perfect 1:1 motion of your head when driving around a racing circuit in some car (e.g. accelerometers, optical methods, video recording, whatever). If you were to play that back on a screen to yourself, it would almost certainly be more difficult to "make sense of" than when you were actually in the car. Partly because it's a limited amount of visual information due to the narrow screen angle, but also because there's no physical feedback for your eye-brain to work off in order to cancel out the movements of your head relative to the movements in the images its receiving - the purely image-based "object tracking", as mentioned above (head still, image moving), is limited in its speed, so it is helped by the automatic movement of the eyes when the head is moving, too (of course, if your head is moving, the image is potentially moving more, or less, depending on the focus). To be clear, in a racing environment, your head is moving and the focus is not (as much), so the movement of your eyes is very important.
If your head isn't moving, the sum of your natural adaptations to unstable images that include the effects of head movement will function at a much reduced efficacy. The challenge, then, is to recreate the "drama" (everyone loves drama) but none of the excess disorientation that comes with the inevitable disconnect between the inner ear and the eye. I'd suggest this means a much more stable image than what you'd get from "helmet mount" recordings, but some way of conveying that movement all the same.