image-processing computer-vision classification video-processing

How to classify video frames to frames containing objects and meaningless frames

I want to do some detection and classification work on video frames, however, there are too many frames in a video to be processed, so I want to find which frames contain objects and which frames are meaningless(not contain objects or faces) so that I can save some time by detecting on less frames.

I already test Gist and SVM, trying to separate images containing dogs(pascal voc) from forest scene images(15 scene dataset), but the accuracy on test data is very low(less than 50%).

Is there any other feature or algorithm suitable for this task? Also is there any data set suitable for this task?

Solution

You could look into visual saliency detection methods. If there are saliency clusters, these frames likely contain objects.

How to parametrize the contour of an elliptic BW image
Practical way of explaining "Information Theory"
How to do Image Alpha Matting in Python with a Given Trimap (Based on Closed Form Solution to Natural Image Matting)
How to use JpegTran to recursively process all images, overwriting them, in a directory using Windows?
jpegtran optimize without changing filename
How to Set Up & Use Jpegtran with Windows
How to perform OpenCV boxFilter for a ROI?
Compromise between quality and file size, how to save a very detailed image into a file with reasonable size (<1MB)?
Creating a Laplacian Matrix and Solving the Linear Equation for Image Filtering
How do I reduce a specific colour range to a single colour?
Video Editing Books
CUDA image upsampling with FFT method
Changing the Opacity of a Bitmap image
Finding CheckerBoard Points in opencv for any random ChessBoard( pattern size not known)
C++ multi-threaded data management
custom image filter
Inverting a real-valued index grid
How do I crop the solar panels captured by drone?
API to manipulate photo overlays
How to check whether a JPEG image is color or gray scale using only Python stdlib
Text Image Binary Classifier without deep learning or tesseract
OpenCV not able to detect aruco marker within image created with opencv
How to allow Chrome to access my camera on localhost?
Face Recognition Logic
Visualize an image pixel buffer with a bit depth of 3 i.e. each pixel having 3 bits of data
python - Implementing Sobel operators with python without opencv
Filter fluctuating lighting with OpenCV
Blob detection on embedded platform, memory restricted
How can I transfer data efficiently via images?
Image Processing: Algorithm Improvement for 'Coca-Cola Can' Recognition