i solved the problem mentioned yesterday, the num_classes set to be 3, and change VOC_LABELS to MY_LABELS
but i get another bug
ssh://[email protected]:22/home/jiawen/anaconda3/envs/mnist_test/bin/python3.5 -u /home/jiawen/gitcode/ssd.tensorflow/train_ssd.py
2018-07-05 15:28:46.450990: I tensorflow/core/platform/cpu_feature_guard.cc:140] Your CPU supports instructions that this TensorFlow binary was not compiled to use: SSE4.1 SSE4.2 AVX AVX2 FMA
2018-07-05 15:28:46.940678: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:898] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero
2018-07-05 15:28:46.941827: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1356] Found device 0 with properties:
name: Tesla P100-PCIE-16GB major: 6 minor: 0 memoryClockRate(GHz): 1.3285
pciBusID: 0000:00:08.0
totalMemory: 15.90GiB freeMemory: 15.61GiB
2018-07-05 15:28:46.941862: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1435] Adding visible gpu devices: 0
2018-07-05 15:28:47.291170: I tensorflow/core/common_runtime/gpu/gpu_device.cc:923] Device interconnect StreamExecutor with strength 1 edge matrix:
2018-07-05 15:28:47.291233: I tensorflow/core/common_runtime/gpu/gpu_device.cc:929] 0
2018-07-05 15:28:47.291253: I tensorflow/core/common_runtime/gpu/gpu_device.cc:942] 0: N
2018-07-05 15:28:47.291677: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1053] Created TensorFlow device (/device:GPU:0 with 15135 MB memory) -> physical GPU (device: 0, name: Tesla P100-PCIE-16GB, pci bus id: 0000:00:08.0, compute capability: 6.0)
INFO:tensorflow:Replicating the model_fn
across ['/device:GPU:0']. Variables are going to be placed on ['/device:GPU:0']. Consolidation device is going to be /device:GPU:0.
INFO:tensorflow:Using config: {'_evaluation_master': '', '_model_dir': './logs/', '_num_worker_replicas': 1, '_session_config': gpu_options {
per_process_gpu_memory_fraction: 1.0
}
allow_soft_placement: true
, '_task_id': 0, '_master': '', '_task_type': 'worker', '_global_id_in_cluster': 0, '_cluster_spec': <tensorflow.python.training.server_lib.ClusterSpec object at 0x7f791f7da630>, '_is_chief': True, '_keep_checkpoint_every_n_hours': 10000, '_keep_checkpoint_max': 5, '_save_checkpoints_secs': 7200, '_log_step_count_steps': 10, '_num_ps_replicas': 0, '_tf_random_seed': 20180503, '_service': None, '_train_distribute': None, '_save_summary_steps': 500, '_save_checkpoints_steps': None}
Starting a training cycle.
INFO:tensorflow:Calling model_fn.
WARNING:tensorflow:From /home/jiawen/gitcode/ssd.tensorflow/net/ssd_net.py:114: calling reduce_sum (from tensorflow.python.ops.math_ops) with keep_dims is deprecated and will be removed in a future version.
Instructions for updating:
keep_dims is deprecated, use keepdims instead
/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/ops/gradients_impl.py:100: UserWarning: Converting sparse IndexedSlices to a dense Tensor of unknown shape. This may consume a large amount of memory.
"Converting sparse IndexedSlices to a dense Tensor of unknown shape. "
INFO:tensorflow:Ignoring --checkpoint_path because a checkpoint already exists in ./logs/.
INFO:tensorflow:Done calling model_fn.
INFO:tensorflow:Create CheckpointSaverHook.
INFO:tensorflow:Graph was finalized.
2018-07-05 15:28:52.828754: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1435] Adding visible gpu devices: 0
2018-07-05 15:28:52.828818: I tensorflow/core/common_runtime/gpu/gpu_device.cc:923] Device interconnect StreamExecutor with strength 1 edge matrix:
2018-07-05 15:28:52.828834: I tensorflow/core/common_runtime/gpu/gpu_device.cc:929] 0
2018-07-05 15:28:52.828849: I tensorflow/core/common_runtime/gpu/gpu_device.cc:942] 0: N
2018-07-05 15:28:52.829054: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1053] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 15135 MB memory) -> physical GPU (device: 0, name: Tesla P100-PCIE-16GB, pci bus id: 0000:00:08.0, compute capability: 6.0)
INFO:tensorflow:Restoring parameters from ./logs/model.ckpt-1
Traceback (most recent call last):
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 1322, in _do_call
return fn(*args)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 1307, in _run_fn
options, feed_dict, fetch_list, target_list, run_metadata)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 1409, in _call_tf_sessionrun
run_metadata)
tensorflow.python.framework.errors_impl.InvalidArgumentError: Assign requires shapes of both tensors to match. lhs shape= [9] rhs shape= [63]
[[Node: save/Assign_116 = Assign[T=DT_FLOAT, _class=["loc:@ssd300/multibox_head/cls_5/bias"], use_locking=true, validate_shape=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](ssd300/multibox_head/cls_5/bias/Momentum, save/RestoreV2_1/_1)]]
[[Node: save/RestoreV2_1/_254 = _SendT=DT_FLOAT, client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device_incarnation=1, tensor_name="edge_262_save/RestoreV2_1", _device="/job:localhost/replica:0/task:0/device:CPU:0"]]
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/home/jiawen/gitcode/ssd.tensorflow/train_ssd.py", line 464, in
tf.app.run()
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/platform/app.py", line 126, in run
_sys.exit(main(argv))
File "/home/jiawen/gitcode/ssd.tensorflow/train_ssd.py", line 460, in main
hooks=[logging_hook], max_steps=FLAGS.max_number_of_steps)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/estimator/estimator.py", line 363, in train
loss = self._train_model(input_fn, hooks, saving_listeners)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/estimator/estimator.py", line 843, in _train_model
return self._train_model_default(input_fn, hooks, saving_listeners)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/estimator/estimator.py", line 859, in _train_model_default
saving_listeners)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/estimator/estimator.py", line 1056, in _train_with_estimator_spec
log_step_count_steps=self._config.log_step_count_steps) as mon_sess:
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/monitored_session.py", line 405, in MonitoredTrainingSession
stop_grace_period_secs=stop_grace_period_secs)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/monitored_session.py", line 816, in init
stop_grace_period_secs=stop_grace_period_secs)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/monitored_session.py", line 539, in init
self._sess = _RecoverableSession(self._coordinated_creator)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/monitored_session.py", line 1002, in init
_WrappedSession.init(self, self._create_session())
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/monitored_session.py", line 1007, in _create_session
return self._sess_creator.create_session()
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/monitored_session.py", line 696, in create_session
self.tf_sess = self._session_creator.create_session()
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/monitored_session.py", line 467, in create_session
init_fn=self._scaffold.init_fn)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/session_manager.py", line 279, in prepare_session
config=config)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/session_manager.py", line 207, in _restore_checkpoint
saver.restore(sess, ckpt.model_checkpoint_path)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/saver.py", line 1802, in restore
{self.saver_def.filename_tensor_name: save_path})
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 900, in run
run_metadata_ptr)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 1135, in _run
feed_dict_tensor, options, run_metadata)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 1316, in _do_run
run_metadata)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/client/session.py", line 1335, in _do_call
raise type(e)(node_def, op, message)
tensorflow.python.framework.errors_impl.InvalidArgumentError: Assign requires shapes of both tensors to match. lhs shape= [9] rhs shape= [63]
[[Node: save/Assign_116 = Assign[T=DT_FLOAT, _class=["loc:@ssd300/multibox_head/cls_5/bias"], use_locking=true, validate_shape=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](ssd300/multibox_head/cls_5/bias/Momentum, save/RestoreV2_1/_1)]]
[[Node: save/RestoreV2_1/_254 = _SendT=DT_FLOAT, client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device_incarnation=1, tensor_name="edge_262_save/RestoreV2_1", _device="/job:localhost/replica:0/task:0/device:CPU:0"]]
Caused by op 'save/Assign_116', defined at:
File "/home/jiawen/gitcode/ssd.tensorflow/train_ssd.py", line 464, in
tf.app.run()
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/platform/app.py", line 126, in run
_sys.exit(main(argv))
File "/home/jiawen/gitcode/ssd.tensorflow/train_ssd.py", line 460, in main
hooks=[logging_hook], max_steps=FLAGS.max_number_of_steps)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/estimator/estimator.py", line 363, in train
loss = self._train_model(input_fn, hooks, saving_listeners)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/estimator/estimator.py", line 843, in _train_model
return self._train_model_default(input_fn, hooks, saving_listeners)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/estimator/estimator.py", line 859, in _train_model_default
saving_listeners)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/estimator/estimator.py", line 1056, in _train_with_estimator_spec
log_step_count_steps=self._config.log_step_count_steps) as mon_sess:
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/monitored_session.py", line 405, in MonitoredTrainingSession
stop_grace_period_secs=stop_grace_period_secs)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/monitored_session.py", line 816, in init
stop_grace_period_secs=stop_grace_period_secs)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/monitored_session.py", line 539, in init
self._sess = _RecoverableSession(self._coordinated_creator)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/monitored_session.py", line 1002, in init
_WrappedSession.init(self, self._create_session())
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/monitored_session.py", line 1007, in _create_session
return self._sess_creator.create_session()
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/monitored_session.py", line 696, in create_session
self.tf_sess = self._session_creator.create_session()
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/monitored_session.py", line 458, in create_session
self._scaffold.finalize()
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/monitored_session.py", line 214, in finalize
self._saver.build()
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/saver.py", line 1347, in build
self._build(self._filename, build_save=True, build_restore=True)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/saver.py", line 1384, in _build
build_save=build_save, build_restore=build_restore)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/saver.py", line 829, in _build_internal
restore_sequentially, reshape)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/saver.py", line 525, in _AddShardedRestoreOps
name="restore_shard"))
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/saver.py", line 494, in _AddRestoreOps
assign_ops.append(saveable.restore(saveable_tensors, shapes))
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/training/saver.py", line 185, in restore
self.op.get_shape().is_fully_defined())
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/ops/state_ops.py", line 283, in assign
validate_shape=validate_shape)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/ops/gen_state_ops.py", line 60, in assign
use_locking=use_locking, name=name)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/framework/op_def_library.py", line 787, in _apply_op_helper
op_def=op_def)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/framework/ops.py", line 3392, in create_op
op_def=op_def)
File "/home/jiawen/anaconda3/envs/mnist_test/lib/python3.5/site-packages/tensorflow/python/framework/ops.py", line 1718, in init
self._traceback = self._graph._extract_stack() # pylint: disable=protected-access
InvalidArgumentError (see above for traceback): Assign requires shapes of both tensors to match. lhs shape= [9] rhs shape= [63]
[[Node: save/Assign_116 = Assign[T=DT_FLOAT, _class=["loc:@ssd300/multibox_head/cls_5/bias"], use_locking=true, validate_shape=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](ssd300/multibox_head/cls_5/bias/Momentum, save/RestoreV2_1/_1)]]
[[Node: save/RestoreV2_1/_254 = _SendT=DT_FLOAT, client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device_incarnation=1, tensor_name="edge_262_save/RestoreV2_1", _device="/job:localhost/replica:0/task:0/device:CPU:0"]]
Process finished with exit code 1
can you list out all lines which should be changed if i want to train by my own data