lim-anggun / fgsegnet_v2 Goto Github PK

FgSegNet_v2: "Learning Multi-scale Features for Foreground Segmentation.” by Long Ang LIM and Hacer YALIM KELES

Home Page: https://arxiv.org/abs/1808.01477

License: Other

Python 70.43% Jupyter Notebook 20.65% C++ 8.68% Makefile 0.24%

foreground-detection foreground-segmentation-network fgsegnet background-subtraction feature-pooling-module fpm-module video-surveillance

fgsegnet_v2's Introduction

FgSegNet_v2 : Foreground Segmentation Network version 2

This repository contains source codes and training sets for the following paper:

"Learning multi-scale features for foreground segmentation." by Long Ang LIM and Hacer YALIM KELES

Published at Pattern Analysis and Applications
The preprint version is available at: https://arxiv.org/abs/1808.01477

Citation

If you find FgSegNet_v2 useful in your research, please consider citing:

Lim, L.A. & Keles, H.Y. Pattern Anal Applic (2019). https://doi.org/10.1007/s10044-019-00845-9

Preprint:

@article{lim2018learning,
	  title={Learning Multi-scale Features for Foreground Segmentation},
	  author={Lim, Long Ang and Keles, Hacer Yalim},
	  journal={arXiv preprint arXiv:1808.01477},
	  year={2018}
}

Requirements

This work was implemented with the following frameworks:

Spyder 3.2.x (recommended)
Python 3.6.3
Keras 2.0.6
Tensorflow-gpu 1.1.0

Usage

Clone this repo: git clone https://github.com/lim-anggun/FgSegNet_v2.git

Download CDnet2014, SBI2015 and UCSD datasets, then put them in the following directory structure:

Example:

 FgSegNet_v2/
      scripts/FgSegNet_v2_CDnet.py
             /FgSegNet_v2_SBI.py
             /FgSegNet_v2_UCSD.py
             /FgSegNet_v2_module.py
             /instance_normalization.py
             /my_upsampling_2d.py
	     /prediction_example.ipynb
	     
      datasets/
              /CDnet2014_dataset/...
              /SBI2015_dataset/...
              /UCSD_dataset/...
	  
      training_sets/
                   /CDnet2014_train/...
                   /SBI2015_train/...
                   /UCSD_train20/...
                   /UCSD_train50/...
	       
  testing_scripts/extract_mask.py
  		 /thresholding.py
		 /python_metrics/...

Run the codes with Spyder IDE. Note that all trained models will be automatically saved (in current working directory) for you.
Here is how to extract foreground masks. Suppose your files are stored in the following dir structures:

     # Script file in the root dir
     extract_mask.py
     
     # Dataset downloaded from changedetection.net
     CDnet2014_dataset/baseline/...
     		      /cameraJitter/...
		      /badWeather/...
	
     # your trained model dir (models25 = models trained with 25 frames, (50frames, 200frames)
     FgSegNet_v2/models25/baseline/mdl_highway.h5
     				  /mdl_pedestrians.h5
				  ...
		         /cameraJitter/mdl_badminton.h5
			 	      /mdl_traffic.h5
				      /...
			/...

Go to Window cmd and run:

> python extract_mask.py

Your extracted frames will be automatically stored in FgSegNet_v2/results25/[CATEGORY_NAME]/[SCENE_NAME]/[binXXXXXX.png, ...]

Threshold your foreground masks. Suppose that your extracted frames are stored in above folders. Go to cmd and run:

> python thresholding.py

Your thresholded frames will be automatically stored in FgSegNet_v2/results25_th[0.X]/[CATEGORY_NAME]/[SCENE_NAME]/[binXXXXXX.png, ...]

Remove training frames from your thresholded frames and evaluate your results.

Evaluation

We evaluate our method using three different datasets as described in here or here.

e.g.

> cd python_metrics
> python processFolder.py dataset_path root_path_of_thresholded_frames

Results

Results on CDnet2014 dataset

Table below shows overall results across 11 categories obtained from Change Detection 2014 Challenge.

Methods	PWC	F-Measure	Speed (320x240, batch-size=1) on NVIDIA GTX 970 GPU
FgSegNet_v2	0.0402	0.9847	23fps

Results on SBI2015 dataset

Table below shows overall test results across 14 video sequences.

Methods	PWC	F-Measure
FgSegNet_v2	0.7148	0.9853

Results on UCSD Background Subtraction dataset

Table below shows overall test results across 18 video sequences.

Methods	PWC (20% split)	F-Measure (20% split)	PWC (50% split)	F-Measure (50% split)
FgSegNet_v2	0.6136	0.8945	0.4405	0.9203

YouTube

Updates

09/11/2019:

add testing scripts as requested

07/08/2018:

add FgSegNet_v2 source codes and training frames

04/02/2019:

add a jupyter notebook & a YouTube video

Contact

lim.longang at gmail.com
Any issues/discussions are welcome.

fgsegnet_v2's People

Contributors

Stargazers

Watchers

Forkers

grabber dengshuo stevenshuofeng sherrycloudy ioekg jackyfriend prpankajsingh dongyanchaotj ma-gu marcoscleison smorodov gxuliqin zzy13 tolleybot shugraphics xrosliang akhilesh64 dr-munirshah arishin maoxuli tenghui98 radheopti17 helenypzhang baucheng leanghorn shinemyangel immkapoor rajatenzyme yuanhanxiagn chengjianglong yarmars jasonyang36 zj15001 barongeng linchen-cherry ho0-kim wpb3dm yourisurtel skynetcity

fgsegnet_v2's Issues

ModuleNotFoundError 'keras.engine.base_layer'

Hi, thanks for the job!
I follow the instruction to run these project, after I installed the environment, I run "python3 FgSegNet_v2_CDnet.py" but I got some error below:
Traceback (most recent call last): File "FgSegNet_v2_CDnet.py", line 37, in <module> from FgSegNet_v2_module import FgSegNet_v2_module File "/home/jhd/face_recognition/softwares/FgSegNet_v2/scripts/FgSegNet_v2_module.py", line 15, in <module> from my_upsampling_2d import MyUpSampling2D File "/home/jhd/face_recognition/softwares/FgSegNet_v2/scripts/my_upsampling_2d.py", line 13, in <module> from keras.engine.base_layer import InputSpec ModuleNotFoundError: No module named 'keras.engine.base_layer'

Is there something I miss?

the accuracy of a catogery is very low the port_0_17fps : 0.435026037734113

it is very strange,the most are 99% and 98%,a liittle 96%
but the port_0_17fps : 0.435026037734113

CDnet Utilities link address is missing

Hi！ This CDnet Utilities link cannot be opened. Can you send it again? Thanks!

Design of decoder

Dear author, thanks for your paper about change detection. I' m very interested in the design of decoder in your architecture. However, I am curious why you combine information in that way : alpha*f + f where alpha is average pooling of information from upper layer. Numerous papers tend to implement network like U-net in semantic segmentation field.

About the meaning of void_label

I see the code in the script , such as in the loss function, the void_label = -1.
I am confused, is -1 means background or something else?

unable to load the model after adding instance_normalization

@lim-anggun I have read your previous comment on adding the instance_nornalization.py .But I am still getting the error
I have added this line of code as mentioned by you.
from FgSegNet.instance_normalization import InstanceNormalization model = load_model(mdl_path, custom_objects={'MyUpSampling2D': MyUpSampling2D, 'InstanceNormalization': InstanceNormalization})

Error :

`-----------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-26-69f03b7b76fd> in <module>
      4 mdl_path = 'FgSegNet_M/CDnet/models50/baseline/mdl_pedestrians.h5'
      5 from FgSegNet.instance_normalization import InstanceNormalization
----> 6 model = load_model(mdl_path, custom_objects={'MyUpSampling2D': MyUpSampling2D, 'InstanceNormalization': InstanceNormalization})
      7 #from FgSegNet_v2_module.py import loss2, acc2
      8 #model = load_model(mdl_path, custom_objects={'MyUpSampling2D': MyUpSampling2D, 'InstanceNormalization': InstanceNormalization})

~/anaconda3/envs/p3/lib/python3.6/site-packages/keras/models.py in load_model(filepath, custom_objects, compile)
    262                       metrics=metrics,
    263                       loss_weights=loss_weights,
--> 264                       sample_weight_mode=sample_weight_mode)
    265 
    266         # Set optimizer weights.

~/anaconda3/envs/p3/lib/python3.6/site-packages/keras/engine/training.py in compile(self, optimizer, loss, metrics, loss_weights, sample_weight_mode, **kwargs)
    679             loss_functions = [losses.get(l) for l in loss]
    680         else:
--> 681             loss_function = losses.get(loss)
    682             loss_functions = [loss_function for _ in range(len(self.outputs))]
    683         self.loss_functions = loss_functions

~/anaconda3/envs/p3/lib/python3.6/site-packages/keras/losses.py in get(identifier)
    100     if isinstance(identifier, six.string_types):
    101         identifier = str(identifier)
--> 102         return deserialize(identifier)
    103     elif callable(identifier):
    104         return identifier

~/anaconda3/envs/p3/lib/python3.6/site-packages/keras/losses.py in deserialize(name, custom_objects)
     92                                     module_objects=globals(),
     93                                     custom_objects=custom_objects,
---> 94                                     printable_module_name='loss function')
     95 
     96 

~/anaconda3/envs/p3/lib/python3.6/site-packages/keras/utils/generic_utils.py in deserialize_keras_object(identifier, module_objects, custom_objects, printable_module_name)
    157             if fn is None:
    158                 raise ValueError('Unknown ' + printable_module_name +
--> 159                                  ':' + function_name)
    160         return fn
    161     else:

ValueError: Unknown loss function:loss

I would appreciate your advice on this. Thank you.

How to handle real video

Thank you for your work,
I have a question, I see that the code is training for each test set. So now I have a real video (not in the dataset), no ground truth. So how should I extract the foreground?

Can it work well on other dataset?

I want to use it in some real scenes, can this deep net work well on other datasets, such as VOC? Will it be trained well on non-video datasets?

License File

Can you please add a license file to clarify the usage limits of your code?

How can I train with my own video sequence?

As I know, if I want to train with my own video sequence, I should manually config FgSegNetModule.py.
But I'm a newbie on keras even on deep learning.
I found that I should modify the code bellow to fit my input video:

if dataset_name=='CDnet':
            if(self.scene=='tramCrossroad_1fps'):
                x = MyUpSampling2D(size=(1,1), num_pixels=(2,0), method_name=self.method_name)(x)
            elif(self.scene=='bridgeEntry'):
                x = MyUpSampling2D(size=(1,1), num_pixels=(2,2), method_name=self.method_name)(x)
            elif(self.scene=='fluidHighway'):
                x = MyUpSampling2D(size=(1,1), num_pixels=(2,0), method_name=self.method_name)(x)
            elif(self.scene=='streetCornerAtNight'): 
                x = MyUpSampling2D(size=(1,1), num_pixels=(1,0), method_name=self.method_name)(x)
                x = Cropping2D(cropping=((0, 0),(0, 1)))(x)
            elif(self.scene=='tramStation'):  
                x = Cropping2D(cropping=((1, 0),(0, 0)))(x)
            elif(self.scene=='twoPositionPTZCam'):
                x = MyUpSampling2D(size=(1,1), num_pixels=(0,2), method_name=self.method_name)(x)
            elif(self.scene=='turbulence2'):
                x = Cropping2D(cropping=((1, 0),(0, 0)))(x)
                x = MyUpSampling2D(size=(1,1), num_pixels=(0,1), method_name=self.method_name)(x)
            elif(self.scene=='turbulence3'):
                x = MyUpSampling2D(size=(1,1), num_pixels=(2,0), method_name=self.method_name)(x)

But I don't know what num_pixels I should pass to it...
How can I know what num_pixels corresponding my video sequence?And under what situation I should use Cropping2D()?
And is there anything I should modify?
Thank you very much for replying.

foreground segment images

How can I get the foreground segment images?

ImportError: libcublas.so.8.0: cannot open shared object file: No such file or directory

There is a dependency issue tensorflow 1.1.0 is deprecated. I am currently using Ubuntu 20.04 and cuda 8 is not supported for ubuntu 20.04. Can you please suggest any way out from this error?

multispectral use

Hello, I am writing to consult whether this model is for 3 bands (RGB) only. If the multispectral images are as input, will the model automatically processes all the channels or just the first 3 bands? Thank you.

Memory leak and how to train using a gpu ?

how can i enable gpu training cause i'm trying to train on colab but it always goes beyond the 12Gb RAM limit is it even normal ?
i'm using keras 2.0.6 and tensorflow 1.1.0 as advised but the training process is extremely long and only works with batch_size= 1 otherwise there's a RAM memory problem !

Thanks for your help.
@lim-anggun

compilation error

The log is as follows:

Using TensorFlow backend.

->>> intermittentObjectMotion / abandonedBox
Traceback (most recent call last):
File "extract_mask.py", line 205, in
img = kImage.load_img(ROI_file, grayscale=True)
File "/usr/local/lib/python3.5/dist-packages/keras/preprocessing/image.py", line 322, in load_img
img = pil_image.open(path)
File "/usr/local/lib/python3.5/dist-packages/PIL/Image.py", line 2878, in open
fp = builtins.open(filename, "rb")
FileNotFoundError: [Errno 2] No such file or directory: 'CDnet2014_dataset/intermittentObjectMotion/abandonedBox/ROI.bmp'

How can i solve this error?

->>> baseline / highway
Traceback (most recent call last):

File "", line 1, in
runfile('D:/ShiYanchnegxu/FgSegNet_v2-master/testing_scripts/extract_mask.py', wdir='D:/ShiYanchnegxu/FgSegNet_v2-master/testing_scripts')

File "D:\ProgramData\Anaconda3\lib\site-packages\spyder\utils\site\sitecustomize.py", line 705, in runfile
execfile(filename, namespace)

File "D:\ProgramData\Anaconda3\lib\site-packages\spyder\utils\site\sitecustomize.py", line 102, in execfile
exec(compile(f.read(), filename, 'exec'), namespace)

File "D:/ShiYanchnegxu/FgSegNet_v2-master/testing_scripts/extract_mask.py", line 239, in
model = load_model(mdl_path)

File "D:\ProgramData\Anaconda3\envs\TF1.1.0\lib\site-packages\keras\models.py", line 227, in load_model
with h5py.File(filepath, mode='r') as f:

File "D:\ProgramData\Anaconda3\lib\site-packages\h5py_hl\files.py", line 269, in init
fid = make_fid(name, mode, userblock_size, fapl, swmr=swmr)

File "D:\ProgramData\Anaconda3\lib\site-packages\h5py_hl\files.py", line 99, in make_fid
fid = h5f.open(name, flags, fapl=fapl)

File "h5py_objects.pyx", line 54, in h5py._objects.with_phil.wrapper

File "h5py_objects.pyx", line 55, in h5py._objects.with_phil.wrapper

File "h5py\h5f.pyx", line 78, in h5py.h5f.open

OSError: Unable to open file (unable to open file: name = 'FgSegNet_v2\models25\baseline\mdl_highway.h5', errno = 2, error message = 'No such file or directory', flags = 0, o_flags = 0)

how can I evaluate the segmentation results quantificationally?

Hello, thank you again for sharing your work. You have provided a notebook to see the prediction example , and I want to know how you evaluate the results quantificationally. You extract the resulting segmentation picture from the ipynb, then calculate the F-measures and other metrics somewhere else? Thank you.

Training on real video

Thanks for your work! I want to apply this model on real video, but from the code, it seems that if I train on real video, I still need to label the foreground mask. So do I need to label mask on real video?

How can I load model?

Hi, I had finished training, and I got some *.h5 files. But I'm not familiar with keras, so I use the code like this to load model:
from keras.models import load_model
path = "/path/to/mdl_skating.h5 "
model = load_model(path)

But it raise "ValueError: Unknow layer: InstanceNormalization"

I haven't used keras before. So can you give me a demo to load "mdl_skating.h5" to predict "/CDNet2014_dataset/badWeather/skating" video sequence?

how to train my own data?

thanks a lot for your great work, but how to make my own data like the CDnet2014? and how to train it?

Evaluation Code

Hello how to evaluate the result of the method in CDnet 2014 dataset? I follow your links and tried to follow everything but I got this error.

The file C:\dataset\badWeather\blizzard\stats.txt doesn't exist.
It means there was an error calling the comparator.
Traceback (most recent call last):
  File "modified_process_folder.py", line 129, in <module>
    main()
  File "modified_process_folder.py", line 46, in main
    processFolder(datasetPath, binaryRootPath)
  File "modified_process_folder.py", line 62, in processFolder
    confusionMatrix = compareWithGroungtruth(videoPath, binaryPath)  #STATS
  File "modified_process_folder.py", line 84, in compareWithGroungtruth
    return readCMFile(statFilePath)
  File "modified_process_folder.py", line 90, in readCMFile
    raise Exception('error')
Exception: error

Should we recompile the comparator?, but in original file comparator.exe is exist.