OK, let e try to answer…
First, film has grain; as Ruminator pointed out, and which I mentioned in my one-line reply. Film is a strip of clear material with an emulsion on it. The emulsion is sensitive to light. When it is exposed, it contains a ‘latent image’. Upon processing, the latent image becomes visible. The emulsion may produce colour or black-and-white images, depending on its composition. The thing about the emulsion is that it’s made up of small particles that are uniformly but randomly dispersed on the base. That is, a particle may exist at coordinates X, Y on one frame, but an identical particle isn’t in exactly the same position on the next frame. Multiply this millions of times (or however many times) across the frame, and the grains will seem to ‘swim around’ a bit. Larger grains are more sensitive to light than smaller ones, so ‘fast’ film will appear more grainy than ‘slow’ film and there will be more apparent movement or ‘noise’ (to use a video term).
Video pixels are arranged in a fixed matrix. Since there is no ‘grain’ in the film sense, you don’t get the ‘swimming’ effect on the picture. This, I think, gives video a ‘harder’ look than film.
‘Latitude’ is the degree to which film can be over- or under-exposed and still get a good image. Film has greater latitude than video, so you can get richer contrasts with film.
For any given lens, there is only one plane at which the subject is precisely in focus on the recording medium (film or electronic device). However the focus falls off gradually; so there is a ‘range’ within which the image appears to be in focus. This is called Depth of Field. When making an image, light is controlled to produce a properly-exposed image. Given a recording medium with a set sensitivity (e.g., film at a certain ASA/EI rating) light can be controlled by opening or closing the aperture, increasing or decreasing the exposure time by varying the shutter speed, varying the shutter angle, or increasing or decreasing the amount of light through the addition or subtraction of filters and/or lights. It’s the aperture that controls Depth of Field. A small aperture provides a deeper DoF, and a large aperture provides a shallower one.
In my opinion lenses for video cameras tend to be ‘not as good’ as film lenses – at least for consumer and prosumer cameras. A typical video zoom lens might cost a few hundred to a few thousand dollars. A typical zoom lens for a 16mm camera might cost ten times that. A set of used Zeiss Superspeed prime lenses (i.e., say, five non-zoom lenses) will set you back about $20,000 or more. There’s a company called Redrock Micro that makes a unit that allows prosumer-level (and pro-level) videographers use 35mm cinema lenses on today’s small production video cameras. While the aperture controls DoF for a given lens, the choice of lens also affects DoF. Basically, a longer focal length, the shallower the DoF; and the larger the recording medium, the shorter the DoF. Since the ‘chips’ in a video camera are smaller than a frame of 35mm film, video cameras tend to have a deeper DoF. Many videographers are now starting to use the Redrock unit (or similar one) to capture images with a shallower, more ‘film-like’ Dof. But not everyone does.
So the difference I (personally) see are that film has a ‘live’ quality to it due to the grain of the emulsion, film has greater latitude (and contrast) than video, and DoF seems to be more controllable on film than on video.
YMMV.