Giter Site home page Giter Site logo

detnas's People

Contributors

yukang2017 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

detnas's Issues

if 'caption' in anns[0]: IndexError: list index out of range

In step 5 (model evaluation) when calling the function evaluete_prediction_coco( ), I get an error. Within this function, a value is assigned to a coco_dt variable by returning the call coco_get.loadRes(coco_result), but at this moment the error if 'caption' in anns [0] is displayed: IndexError: list index out of range

Retinanet search

Hi, thank you for sharing the code. But I can't find the config file of retinanet_search. Could you upload it?

Supernet training and tuning

Thanks for your excellent work!

I have not found how to pretrain supernet in ImageNet and tuning supernet in COCO, here is no supernet training, only the training of the searched network.

Why search cfg.SOLVER.IMS_PER_BATCH = 8 ?

cfg.SOLVER.IMS_PER_BATCH = 8
cfg.SOLVER.MAX_ITER = 88888888
cfg.TEST.IMS_PER_BATCH = ngpus_per_node

when I change it, training is very slow.
And when i start more than one run_server.sh: ERROR RuntimeError: Address already in use.
Thx!!!!

Step5 evaluation result very low

Have you ever had a situation where in Step 5 (model evaluation),all the models searched were evaluated very low, that is, the AP value? I trained the supernet both on my own dataset and COCO dataset, but all searched models had a very low AP during step 5, close to 0. However, when I evaluated the model using the script 'tools/test_net.py', the AP was greater than 0.6.

HELP!many issuea,but i flow your tips

$ bash scripts/run_detnas_coco_fpn_300M_search.sh


Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the variable for optimal performance in your application as needed.


Traceback (most recent call last):
File "tools/train_net.py", line 19, in
from maskrcnn_benchmark.data import make_data_loader
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/init.py", line 2, in
from .build import make_data_loader
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/build.py", line 11, in
from . import datasets as D
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/init.py", line 3, in
from .coco import COCODataset
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/coco.py", line 3, in
import torchvision
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/init.py", line 3, in
from torchvision import models
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/init.py", line 12, in
from . import detection
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/init.py", line 1, in
from .faster_rcnn import *
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/faster_rcnn.py", line 13, in
from .rpn import AnchorGenerator, RPNHead, RegionProposalNetwork
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/rpn.py", line 11, in
from . import _utils as det_utils
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py", line 19, in
class BalancedPositiveNegativeSampler(object):
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1219, in script
_compile_and_register_class(obj, _rcb, qualified_name)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1076, in _compile_and_register_class
_jit_script_class_compile(qualified_name, ast, rcb)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/_recursive.py", line 222, in try_compile_fn
return torch.jit.script(fn, _rcb=rcb)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1226, in script
fn = torch._C._jit_script_compile(qualified_name, ast, _rcb, get_default_args(obj))
RuntimeError:
builtin cannot be used as a value:
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:14:56
def zeros_like(tensor, dtype):
# type: (Tensor, int) -> Tensor
return torch.zeros_like(tensor, dtype=dtype, layout=tensor.layout,
~~~~~~~~~~~~~ <--- HERE
device=tensor.device, pin_memory=tensor.is_pinned())
'zeros_like' is being compiled since it was called from 'torch.torchvision.models.detection._utils.BalancedPositiveNegativeSampler.call'
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:72:12

        # randomly select positive and negative examples
        perm1 = torch.randperm(positive.numel(), device=positive.device)[:num_pos]
        perm2 = torch.randperm(negative.numel(), device=negative.device)[:num_neg]

        pos_idx_per_image = positive[perm1]
        neg_idx_per_image = negative[perm2]

        # create binary mask from indices
        pos_idx_per_image_mask = zeros_like(
        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~...  <--- HERE
            matched_idxs_per_image, dtype=torch.uint8
        )
        neg_idx_per_image_mask = zeros_like(
            matched_idxs_per_image, dtype=torch.uint8
        )

        pos_idx_per_image_mask[pos_idx_per_image] = torch.tensor(1, dtype=torch.uint8)
        neg_idx_per_image_mask[neg_idx_per_image] = torch.tensor(1, dtype=torch.uint8)

Traceback (most recent call last):
File "tools/train_net.py", line 19, in
from maskrcnn_benchmark.data import make_data_loader
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/init.py", line 2, in
from .build import make_data_loader
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/build.py", line 11, in
from . import datasets as D
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/init.py", line 3, in
from .coco import COCODataset
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/coco.py", line 3, in
import torchvision
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/init.py", line 3, in
from torchvision import models
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/init.py", line 12, in
from . import detection
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/init.py", line 1, in
from .faster_rcnn import *
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/faster_rcnn.py", line 13, in
from .rpn import AnchorGenerator, RPNHead, RegionProposalNetwork
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/rpn.py", line 11, in
from . import _utils as det_utils
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py", line 19, in
class BalancedPositiveNegativeSampler(object):
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1219, in script
_compile_and_register_class(obj, _rcb, qualified_name)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1076, in _compile_and_register_class
_jit_script_class_compile(qualified_name, ast, rcb)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/_recursive.py", line 222, in try_compile_fn
return torch.jit.script(fn, _rcb=rcb)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1226, in script
fn = torch._C._jit_script_compile(qualified_name, ast, _rcb, get_default_args(obj))
RuntimeError:
builtin cannot be used as a value:
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:14:56
def zeros_like(tensor, dtype):
# type: (Tensor, int) -> Tensor
return torch.zeros_like(tensor, dtype=dtype, layout=tensor.layout,
~~~~~~~~~~~~~ <--- HERE
device=tensor.device, pin_memory=tensor.is_pinned())
'zeros_like' is being compiled since it was called from 'torch.torchvision.models.detection._utils.BalancedPositiveNegativeSampler.call'
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:72:12

        # randomly select positive and negative examples
        perm1 = torch.randperm(positive.numel(), device=positive.device)[:num_pos]
        perm2 = torch.randperm(negative.numel(), device=negative.device)[:num_neg]

        pos_idx_per_image = positive[perm1]
        neg_idx_per_image = negative[perm2]

        # create binary mask from indices
        pos_idx_per_image_mask = zeros_like(
        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~...  <--- HERE
            matched_idxs_per_image, dtype=torch.uint8
        )
        neg_idx_per_image_mask = zeros_like(
            matched_idxs_per_image, dtype=torch.uint8
        )

        pos_idx_per_image_mask[pos_idx_per_image] = torch.tensor(1, dtype=torch.uint8)
        neg_idx_per_image_mask[neg_idx_per_image] = torch.tensor(1, dtype=torch.uint8)

Traceback (most recent call last):
File "tools/train_net.py", line 19, in

from maskrcnn_benchmark.data import make_data_loader

File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/init.py", line 2, in
from .build import make_data_loader
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/build.py", line 11, in
from . import datasets as D
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/init.py", line 3, in
from .coco import COCODataset
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/coco.py", line 3, in
import torchvision
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/init.py", line 3, in
from torchvision import models
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/init.py", line 12, in
from . import detection
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/init.py", line 1, in
from .faster_rcnn import *
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/faster_rcnn.py", line 13, in
from .rpn import AnchorGenerator, RPNHead, RegionProposalNetwork
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/rpn.py", line 11, in
from . import _utils as det_utils
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py", line 19, in
class BalancedPositiveNegativeSampler(object):
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1219, in script
Traceback (most recent call last):
File "tools/train_net.py", line 19, in
from maskrcnn_benchmark.data import make_data_loader
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/init.py", line 2, in
from .build import make_data_loader
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/build.py", line 11, in
from . import datasets as D
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/init.py", line 3, in
_compile_and_register_class(obj, _rcb, qualified_name)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1076, in _compile_and_register_class
from .coco import COCODataset
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/coco.py", line 3, in
import torchvision
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/init.py", line 3, in
from torchvision import models
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/init.py", line 12, in
from . import detection
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/init.py", line 1, in
from .faster_rcnn import *
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/faster_rcnn.py", line 13, in
_jit_script_class_compile(qualified_name, ast, rcb)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/_recursive.py", line 222, in try_compile_fn
from .rpn import AnchorGenerator, RPNHead, RegionProposalNetwork
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/rpn.py", line 11, in
return torch.jit.script(fn, _rcb=rcb)
from . import _utils as det_utils
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py", line 19, in
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1226, in script
class BalancedPositiveNegativeSampler(object):
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1219, in script
fn = torch._C._jit_script_compile(qualified_name, ast, _rcb, get_default_args(obj))
RuntimeError:
builtin cannot be used as a value:
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:14:56
def zeros_like(tensor, dtype):
# type: (Tensor, int) -> Tensor
return torch.zeros_like(tensor, dtype=dtype, layout=tensor.layout,
~~~~~~~~~~~~~ <--- HERE
device=tensor.device, pin_memory=tensor.is_pinned())
'zeros_like' is being compiled since it was called from 'torch.torchvision.models.detection._utils.BalancedPositiveNegativeSampler.call'
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:72:12

        # randomly select positive and negative examples
        perm1 = torch.randperm(positive.numel(), device=positive.device)[:num_pos]
        perm2 = torch.randperm(negative.numel(), device=negative.device)[:num_neg]

        pos_idx_per_image = positive[perm1]
        neg_idx_per_image = negative[perm2]

        # create binary mask from indices
        pos_idx_per_image_mask = zeros_like(
        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~...  <--- HERE
            matched_idxs_per_image, dtype=torch.uint8
        )
        neg_idx_per_image_mask = zeros_like(
            matched_idxs_per_image, dtype=torch.uint8
        )

        pos_idx_per_image_mask[pos_idx_per_image] = torch.tensor(1, dtype=torch.uint8)
        neg_idx_per_image_mask[neg_idx_per_image] = torch.tensor(1, dtype=torch.uint8)

_compile_and_register_class(obj, _rcb, qualified_name)

File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1076, in _compile_and_register_class
_jit_script_class_compile(qualified_name, ast, rcb)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/_recursive.py", line 222, in try_compile_fn
return torch.jit.script(fn, _rcb=rcb)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1226, in script
fn = torch._C._jit_script_compile(qualified_name, ast, _rcb, get_default_args(obj))
RuntimeError:
builtin cannot be used as a value:
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:14:56
def zeros_like(tensor, dtype):
# type: (Tensor, int) -> Tensor
return torch.zeros_like(tensor, dtype=dtype, layout=tensor.layout,
~~~~~~~~~~~~~ <--- HERE
device=tensor.device, pin_memory=tensor.is_pinned())
'zeros_like' is being compiled since it was called from 'torch.torchvision.models.detection._utils.BalancedPositiveNegativeSampler.call'
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:72:12

        # randomly select positive and negative examples
        perm1 = torch.randperm(positive.numel(), device=positive.device)[:num_pos]
        perm2 = torch.randperm(negative.numel(), device=negative.device)[:num_neg]

        pos_idx_per_image = positive[perm1]
        neg_idx_per_image = negative[perm2]

        # create binary mask from indices
        pos_idx_per_image_mask = zeros_like(
        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~...  <--- HERE
            matched_idxs_per_image, dtype=torch.uint8
        )
        neg_idx_per_image_mask = zeros_like(
            matched_idxs_per_image, dtype=torch.uint8
        )

        pos_idx_per_image_mask[pos_idx_per_image] = torch.tensor(1, dtype=torch.uint8)
        neg_idx_per_image_mask[neg_idx_per_image] = torch.tensor(1, dtype=torch.uint8)

Traceback (most recent call last):
File "tools/train_net.py", line 19, in
from maskrcnn_benchmark.data import make_data_loader
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/init.py", line 2, in
from .build import make_data_loader
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/build.py", line 11, in
from . import datasets as D
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/init.py", line 3, in
from .coco import COCODataset
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/coco.py", line 3, in
import torchvision
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/init.py", line 3, in
from torchvision import models
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/init.py", line 12, in
from . import detection
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/init.py", line 1, in
from .faster_rcnn import *
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/faster_rcnn.py", line 13, in
from .rpn import AnchorGenerator, RPNHead, RegionProposalNetwork
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/rpn.py", line 11, in
from . import _utils as det_utils
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py", line 19, in
class BalancedPositiveNegativeSampler(object):
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1219, in script
Traceback (most recent call last):
Traceback (most recent call last):
File "tools/train_net.py", line 19, in
_compile_and_register_class(obj, _rcb, qualified_name)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1076, in _compile_and_register_class
Traceback (most recent call last):
File "tools/train_net.py", line 19, in
File "tools/train_net.py", line 19, in
from maskrcnn_benchmark.data import make_data_loader
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/init.py", line 2, in
from maskrcnn_benchmark.data import make_data_loader
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/init.py", line 2, in
from maskrcnn_benchmark.data import make_data_loader
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/init.py", line 2, in
from .build import make_data_loader
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/build.py", line 11, in
from .build import make_data_loader
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/build.py", line 11, in
from . import datasets as D
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/init.py", line 3, in
from . import datasets as D
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/init.py", line 3, in
_jit_script_class_compile(qualified_name, ast, rcb)
from .coco import COCODataset
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/coco.py", line 3, in
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/_recursive.py", line 222, in try_compile_fn
from .coco import COCODataset
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/coco.py", line 3, in
from .build import make_data_loader
import torchvision
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/build.py", line 11, in
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/init.py", line 3, in
import torchvision
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/init.py", line 3, in
from torchvision import models
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/init.py", line 12, in
from . import datasets as D
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/init.py", line 3, in
from torchvision import models
from . import detection
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/init.py", line 12, in
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/init.py", line 1, in
from .coco import COCODataset
from .faster_rcnn import *from . import detection

File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/faster_rcnn.py", line 13, in
File "/home/mist/DetNAS-master/maskrcnn_benchmark/data/datasets/coco.py", line 3, in
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/init.py", line 1, in
return torch.jit.script(fn, _rcb=rcb)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1226, in script
from .rpn import AnchorGenerator, RPNHead, RegionProposalNetwork
from .faster_rcnn import * File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/rpn.py", line 11, in

import torchvision  File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/faster_rcnn.py", line 13, in <module>

File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/init.py", line 3, in
from . import _utils as det_utils
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py", line 19, in
from .rpn import AnchorGenerator, RPNHead, RegionProposalNetworkfrom torchvision import models

File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/init.py", line 12, in
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/rpn.py", line 11, in
class BalancedPositiveNegativeSampler(object):
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1219, in script
from . import detectionfrom . import _utils as det_utils

File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/init.py", line 1, in
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py", line 19, in
class BalancedPositiveNegativeSampler(object):
from .faster_rcnn import * File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1219, in script

File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/faster_rcnn.py", line 13, in
from .rpn import AnchorGenerator, RPNHead, RegionProposalNetwork
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/rpn.py", line 11, in
from . import _utils as det_utils
File "/mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py", line 19, in
fn = torch._C._jit_script_compile(qualified_name, ast, _rcb, get_default_args(obj))
RuntimeError_compile_and_register_class(obj, _rcb, qualified_name):

builtin cannot be used as a value:
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:14:56
def zeros_like(tensor, dtype):
# type: (Tensor, int) -> Tensor
return torch.zeros_like(tensor, dtype=dtype, layout=tensor.layout,
~~~~~~~~~~~~~ <--- HERE
device=tensor.device, pin_memory=tensor.is_pinned())
'zeros_like' is being compiled since it was called from 'torch.torchvision.models.detection._utils.BalancedPositiveNegativeSampler.call'
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:72:12

        # randomly select positive and negative examples
        perm1 = torch.randperm(positive.numel(), device=positive.device)[:num_pos]
        perm2 = torch.randperm(negative.numel(), device=negative.device)[:num_neg]

        pos_idx_per_image = positive[perm1]
        neg_idx_per_image = negative[perm2]

        # create binary mask from indices
        pos_idx_per_image_mask = zeros_like(
        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~...  <--- HERE
            matched_idxs_per_image, dtype=torch.uint8
        )
        neg_idx_per_image_mask = zeros_like(
            matched_idxs_per_image, dtype=torch.uint8
        )

        pos_idx_per_image_mask[pos_idx_per_image] = torch.tensor(1, dtype=torch.uint8)
        neg_idx_per_image_mask[neg_idx_per_image] = torch.tensor(1, dtype=torch.uint8)

File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1076, in _compile_and_register_class

class BalancedPositiveNegativeSampler(object):

File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1219, in script
_compile_and_register_class(obj, _rcb, qualified_name)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1076, in _compile_and_register_class
_jit_script_class_compile(qualified_name, ast, rcb)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/_recursive.py", line 222, in try_compile_fn
_compile_and_register_class(obj, _rcb, qualified_name)return torch.jit.script(fn, _rcb=rcb)
_jit_script_class_compile(qualified_name, ast, rcb)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1226, in script
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/_recursive.py", line 222, in try_compile_fn

File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1076, in _compile_and_register_class
return torch.jit.script(fn, _rcb=rcb)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1226, in script
fn = torch._C._jit_script_compile(qualified_name, ast, _rcb, get_default_args(obj))
RuntimeError: _jit_script_class_compile(qualified_name, ast, rcb)
builtin cannot be used as a value:
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:14:56
def zeros_like(tensor, dtype):
# type: (Tensor, int) -> Tensor
return torch.zeros_like(tensor, dtype=dtype, layout=tensor.layout,
~~~~~~~~~~~~~ <--- HERE
device=tensor.device, pin_memory=tensor.is_pinned())
'zeros_like' is being compiled since it was called from 'torch.torchvision.models.detection._utils.BalancedPositiveNegativeSampler.call'
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:72:12

        # randomly select positive and negative examples
        perm1 = torch.randperm(positive.numel(), device=positive.device)[:num_pos]
        perm2 = torch.randperm(negative.numel(), device=negative.device)[:num_neg]

        pos_idx_per_image = positive[perm1]
        neg_idx_per_image = negative[perm2]

        # create binary mask from indices
        pos_idx_per_image_mask = zeros_like(
        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~...  <--- HERE
            matched_idxs_per_image, dtype=torch.uint8
        )
        neg_idx_per_image_mask = zeros_like(
            matched_idxs_per_image, dtype=torch.uint8
        )

        pos_idx_per_image_mask[pos_idx_per_image] = torch.tensor(1, dtype=torch.uint8)
        neg_idx_per_image_mask[neg_idx_per_image] = torch.tensor(1, dtype=torch.uint8)

File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/_recursive.py", line 222, in try_compile_fn
return torch.jit.script(fn, _rcb=rcb)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/jit/init.py", line 1226, in script
fn = torch._C._jit_script_compile(qualified_name, ast, _rcb, get_default_args(obj))
RuntimeError:
builtin cannot be used as a value:
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:14:56
def zeros_like(tensor, dtype):
# type: (Tensor, int) -> Tensor
return torch.zeros_like(tensor, dtype=dtype, layout=tensor.layout,
~~~~~~~~~~~~~ <--- HERE
device=tensor.device, pin_memory=tensor.is_pinned())
'zeros_like' is being compiled since it was called from 'torch.torchvision.models.detection._utils.BalancedPositiveNegativeSampler.call'
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:72:12

        # randomly select positive and negative examples
        perm1 = torch.randperm(positive.numel(), device=positive.device)[:num_pos]
        perm2 = torch.randperm(negative.numel(), device=negative.device)[:num_neg]

        pos_idx_per_image = positive[perm1]
        neg_idx_per_image = negative[perm2]

        # create binary mask from indices
        pos_idx_per_image_mask = zeros_like(
        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~...  <--- HERE
            matched_idxs_per_image, dtype=torch.uint8
        )
        neg_idx_per_image_mask = zeros_like(
            matched_idxs_per_image, dtype=torch.uint8
        )

        pos_idx_per_image_mask[pos_idx_per_image] = torch.tensor(1, dtype=torch.uint8)
        neg_idx_per_image_mask[neg_idx_per_image] = torch.tensor(1, dtype=torch.uint8)

fn = torch._C._jit_script_compile(qualified_name, ast, _rcb, get_default_args(obj))

RuntimeError:
builtin cannot be used as a value:
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:14:56
def zeros_like(tensor, dtype):
# type: (Tensor, int) -> Tensor
return torch.zeros_like(tensor, dtype=dtype, layout=tensor.layout,
~~~~~~~~~~~~~ <--- HERE
device=tensor.device, pin_memory=tensor.is_pinned())
'zeros_like' is being compiled since it was called from 'torch.torchvision.models.detection._utils.BalancedPositiveNegativeSampler.call'
at /mistgpu/miniconda/lib/python3.7/site-packages/torchvision/models/detection/_utils.py:72:12

        # randomly select positive and negative examples
        perm1 = torch.randperm(positive.numel(), device=positive.device)[:num_pos]
        perm2 = torch.randperm(negative.numel(), device=negative.device)[:num_neg]

        pos_idx_per_image = positive[perm1]
        neg_idx_per_image = negative[perm2]

        # create binary mask from indices
        pos_idx_per_image_mask = zeros_like(
        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~...  <--- HERE
            matched_idxs_per_image, dtype=torch.uint8
        )
        neg_idx_per_image_mask = zeros_like(
            matched_idxs_per_image, dtype=torch.uint8
        )

        pos_idx_per_image_mask[pos_idx_per_image] = torch.tensor(1, dtype=torch.uint8)
        neg_idx_per_image_mask[neg_idx_per_image] = torch.tensor(1, dtype=torch.uint8)

Traceback (most recent call last):
File "/mistgpu/miniconda/lib/python3.7/runpy.py", line 193, in _run_module_as_main
"main", mod_spec)
File "/mistgpu/miniconda/lib/python3.7/runpy.py", line 85, in _run_code
exec(code, run_globals)
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/distributed/launch.py", line 253, in
main()
File "/mistgpu/miniconda/lib/python3.7/site-packages/torch/distributed/launch.py", line 249, in main
cmd=cmd)
subprocess.CalledProcessError: Command '['/mistgpu/miniconda/bin/python3', '-u', 'tools/train_net.py', '--local_rank=7', '--config-file', 'configs/e2e_faster_rcnn_DETNAS_COCO_FPN_300M_search.yaml', 'OUTPUT_DIR', 'models/DETNAS_COCO_FPN_300M_1x_search']' returned non-zero exit status 1.

Set grad to None if grad.sum() == 0

Hi, great work!

I am quite new in PyTorch, so please forgive me if this is just some common approach in PyTorch. Could you please educate me on why the grad is set to None when grad.sum() == 0 .

for p in model.parameters():
if p.grad is not None and p.grad.sum() == 0:
p.grad = None

The branches which not have been used in each batch will have 0 gradients, right? Then what's the purpose of setting them to None, and how it will benefit the training?

Thanks

COCO training

Thanks for your excellent work!

How much time this step?
COCO training
bash scripts/run_detnas_coco_fpn_300M_search.sh
('-search' in cfg.MODEL.BACKBONE.CONV_BODY is to distinguish supernet training from single model.)
AND
only after training coco, can train search backbone? Can you offer Supernet coco fine-tuning model?

Pre-train accuracy

How does your accuracy on the training set change during the process of pre-training on ImageNet? I found that my training accuracy improves very slowly when training a supernet defined by myself, Is it normal?

Training model in Google Cloud

I'm interested in your approach, and I would like to train the model in a Google cloud virtual machine.
Can you release a tutorial on how to install/setup the model in Google cloud VM?
Or provide any further resources on how to proceed in that situation?
Thank you for you attention to this situation.

problem about syncbn_gpu

hi, I complete 'bash config.sh'. When I finetune Supernet on coco, I get a problem about syncbn_gpu: "syncbn_gpu.cpython-37m-x86_64-linux-gnu.so: undefined symbol: _ZN3c1011CPUTensorIdEv"
Maybe it is a version problem? My python=3.7 torch=1.3 torchvision=0.4.1, cuda=10.1 cudnn=7.6.5
Looking forward to your help.

I solved that, maybe it is the cpython version problem.

Pre-train time

Hello, when running your pre-train code, it finds iteration once, which takes an average of 1.3 seconds, then 300k iteration is expected to take at least 108 hours (4.5 days), but your paper says 1.5 days, my experimental environment is as follows:
Hardware Configuration:
1, GPU: 8 * Tesla V100-PCIE
2, CPU: 88 * virtual core
Software configuration:
1, pytorch==1.0.1.post2
2, batchsize=1024
3, DataLoader: num_workers=80
4, datasets: Imagenet
Is it that you are using other acceleration strategies?

I met a problem when I installed compile.sh

syncbn_cuda_kernel.cu:12:35: fatal error: ATen/cuda/CUDAContext.h: No such file or directory
compilation terminated.
error: command '/usr/local/cuda/bin/nvcc' failed with exit status 1

bash config.sh

Hello, when I perform bash config.sh appear
nvcc fatal : Unsupported gpu architecture 'compute_75
How can I solve it

installation

Installation
Modify the path to your coco dataset in config.sh.
bash config.sh
According to the above installation instructions, after installation, why does the second step lead to errors?

Pre-train time

Hello, when running your pre-train code, it finds iteration once, which takes an average of 1.3 seconds, then 300k iteration is expected to take at least 108 hours (4.5 days), but your paper says 1.5 days, my experimental environment is as follows:
Hardware Configuration:
1, GPU: 8 * Tesla V100-PCIE
2, CPU: 88 * virtual core
Software configuration:
1, pytorch==1.0.1.post2
2, batchsize=1024
3, DataLoader: num_workers=80
4, datasets: Imagenet
Is it that you are using other acceleration strategies?

win10 compile error

error: command 'C:\Program Files (x86)\Microsoft Visual Studio\2017\Community\VC\Tools\MSVC\14.16.27023\bin\HostX86\x64\link.exe' failed with exit status 1120

PIXEL_MEAN:[0., 0., 0.]?

Hi, thanks for your excellent work!
I've found PIXEL_MEAN:[0., 0., 0.] in DetNAS configuration, which is different from the original maskrcnn-benchmark setting and is somewhat counterintuitive.
Could you please explain it?

demo

could you please send the demo code so that i can run it to test the trained model with visualization? thank U

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.