I am trying to get a better understanding of video sampling and would appreciate a little help.
First of all is the 4:4:4 or 4:2:2 notation referred to as color space?
Okay, I’m going to go on a little rant regarding what I think I know (which is very little). Please feel free to help me out here.
Three discrete values are sampled in digitizing video, luminance, and two color difference channels. Unlike digital audio, digital video requires a lot more bandwidth for two reasons. One, a video screen is two dimensional whereas audio is a one dimensional stream of vales. Two, video data requires three values for each “sample” or pixel, each audio sample only requires one amplitude value.
If the full horizontal resolution is 720 (I believe that is called D1) and the sampling is 4:4:4 then the horizontal resolution of all three channels is 720. For DV video the sampling is 4:1:1 which means the color difference signals really only have an effective resolution of 720/4 or 180 pixels. Now, I have read that since the human eye is more sensitive to the luminance the grouping of color for these four pixels isn't that big a deal. Except for color keying, of course since it NEEDS the color resolution to discern edges/boundaries.
My first question is what exactly are the two color difference channels and how is the third color determined? If you have two equations and three unknowns, you have to know something else to get enough info to find the value of the three unknowns. Do the three colors equal the value of the luminance channel?
My next question is about the actual sampling rates. Somewhere on the web I read that video is sampled at 13.5 MHz. Or, 13.5 million pixels are sampled, with three corresponding values, per second.
13.5 million samples per second divided by 59.94 fields per second equals 2.25 x 10^5 pixels that are sampled per second for NTSC video.
If there are 720 pixels across the screen, then 2.25 x 10^5 divided by 720 equals 312.81 pixels of resolution per field vertically. It seems in the video world this is called the number of scan lines.
Now, I thought that NTSC video has 525 scan lines or 262.5 scan lines per field?
Also, why is this number not whole? Do I have the sampling frequency wrong?
Any thought/comments/help would be greatly appreciated. I’m a high school science teacher (physics/chemistry) with a background in mechanical engineering so we didn’t get into this in college. I have been getting prodded to teach a digital video course (develop one, actually) and if I do it, I want a decent background in the technical side of things. I can’t stand when teachers just teach the software without going into any of the science. I think spending some time on the basics will allows the kids to really understand what the software is doing, and how to use various video software without having to read every manual.
First of all is the 4:4:4 or 4:2:2 notation referred to as color space?
Okay, I’m going to go on a little rant regarding what I think I know (which is very little). Please feel free to help me out here.
Three discrete values are sampled in digitizing video, luminance, and two color difference channels. Unlike digital audio, digital video requires a lot more bandwidth for two reasons. One, a video screen is two dimensional whereas audio is a one dimensional stream of vales. Two, video data requires three values for each “sample” or pixel, each audio sample only requires one amplitude value.
If the full horizontal resolution is 720 (I believe that is called D1) and the sampling is 4:4:4 then the horizontal resolution of all three channels is 720. For DV video the sampling is 4:1:1 which means the color difference signals really only have an effective resolution of 720/4 or 180 pixels. Now, I have read that since the human eye is more sensitive to the luminance the grouping of color for these four pixels isn't that big a deal. Except for color keying, of course since it NEEDS the color resolution to discern edges/boundaries.
My first question is what exactly are the two color difference channels and how is the third color determined? If you have two equations and three unknowns, you have to know something else to get enough info to find the value of the three unknowns. Do the three colors equal the value of the luminance channel?
My next question is about the actual sampling rates. Somewhere on the web I read that video is sampled at 13.5 MHz. Or, 13.5 million pixels are sampled, with three corresponding values, per second.
13.5 million samples per second divided by 59.94 fields per second equals 2.25 x 10^5 pixels that are sampled per second for NTSC video.
If there are 720 pixels across the screen, then 2.25 x 10^5 divided by 720 equals 312.81 pixels of resolution per field vertically. It seems in the video world this is called the number of scan lines.
Now, I thought that NTSC video has 525 scan lines or 262.5 scan lines per field?
Also, why is this number not whole? Do I have the sampling frequency wrong?
Any thought/comments/help would be greatly appreciated. I’m a high school science teacher (physics/chemistry) with a background in mechanical engineering so we didn’t get into this in college. I have been getting prodded to teach a digital video course (develop one, actually) and if I do it, I want a decent background in the technical side of things. I can’t stand when teachers just teach the software without going into any of the science. I think spending some time on the basics will allows the kids to really understand what the software is doing, and how to use various video software without having to read every manual.
Comment