junqiangchen / luna16-lung-nodule-analysis-2016-challenge Goto Github PK

LUNA16-Lung-Nodule-Analysis-2016-Challenge

Home Page: https://luna16.grand-challenge.org/

Python 92.27% Jupyter Notebook 7.73%

lung-cancer-detection vnet resnet classification detection tensroflow python lidc-dataset luna16

luna16-lung-nodule-analysis-2016-challenge's Introduction

LUNA16-LUng-Nodule-Analysis-2016-Challenge

This is an example of the CT images lung nodule detection and false positive reduction from LUNA16-LUng-Nodule-Analysis-2016-Challenge

Prerequisities

The following dependencies are needed:

numpy >= 1.11.1
SimpleITK >=1.0.1
opencv-python >=3.3.0
tensorflow-gpu ==1.8.0
pandas >=0.20.1
scikit-learn >= 0.17.1

How to Use

1、Preprocess

nodule detection

convert annotation.csv file to image mask file:run the LUNA_mask_extraction.py
analyze the ct image,and get the slice thickness and window width and position:run the dataAnaly.py
generate lung nodule ct image and mask:run the data2dprepare.py
generate patch(96,96,16) lung nodule image and mask:run the data3dprepare.py
save lung nodule data and mask into csv file run the utils.py,like this:G:\Data\segmentation\Image/0_161....

nodule classify

convert candidates.csv file to nodule and not-nodule image(48,48,48):run the LUNA_node_extraction.py
Augment the nodule image data: run the Augmain.py
split data into train data(80%) and test data(20%):run the subset.py
save lung nodule data and label into csv file like this:1,G:\Data\classify\1_aug/0_17.npy

2、Nodule Detection

the VNet model

train and predict in the script of vnet3d_train.py and vnet3d_predict.py

3、False Positive Reducution

the ResVGGNet model

train and predict in the script of ResNet3d_train.py and ResNet3d_predict.py

4、download trained model

i have shared the trained model of nodule detection and false positive reduction on here: https://pan.baidu.com/s/1I7zhzmPsTCbz0ZeIntNrUA ,password:orpm

Result

1、Nodule Detection

train loss and train accuracy

the segment result

2、False Positive Reducution

train loss and train accuracy

ROC,Confusion Matrix and Metrics

Contact

https://github.com/junqiangchen
email: [email protected]
Contact: junqiangChen
WeChat Number: 1207173174
WeChat Public number: 最新医学影像技术

luna16-lung-nodule-analysis-2016-challenge's People

Contributors

Stargazers

Watchers

Forkers

kant dyerlee hs99 qujunda minghaobao alonelover cuongnc220592 minshenhao1 neo-cc hengliang8 abv19 liuwenhaha lovedoubledan fariasfc cloudpanl haoshujian wxlearncoding zibagandomkar bubusang yanzhenms soonhwan-kwon niepengfeiisgood aravind-812 rohithakakarla yezhounan syd951186545 anwenzhiyi ankushkamble jackyjlu qianlingjun wuyanrui1996 ashishpatel26 ppvastar gsymine zhubin1 qiujingtao huixugh yty0801 rupali0311 daojin505 makama-md zhudongwork chengmuni66 sipl-zrc qiulimoges houyuejie luan-zb xxx2009-lab ru0628 continue12 further2006 huhongming86 mozg90 perhapzz shrheh20 chen0jie bo666666 hhdqj oceanechy gym200012 ssraghuvanshi 1320068008 ljm198134 iiitmg niexiuping ngoc-mis yanruyang sepidhk techsoft29 mymyth0211 anuragtimilsina hczyni jakaria08 geeteshch wwwshine1995 retire2053 lin960413 sangwan16 bosssir tombling gnvvs-07 franzkingstein alisaqibh1

luna16-lung-nodule-analysis-2016-challenge's Issues

Prediction result

I have trained your models. The results are good by your way.
But when I tested on dataset without segmentation techniques. The results for nodule class(label = 1) is very poor.
Do you think about it ? the test set should not be augmented.

I also want to download your trained models, but it is impossible with foreigners.
Why you do not upload on github ??

Thank you very much

Citation missing!

Hello, it would be helpful if the paper presenting the work in this repo be cited here for reference.

Any elaboration would be much appreciated :) thanks in advance.

怎么通过分割网络的输出的3D PATCH得到mask 的坐标

How to test a complete 3D image?

I want to know how can we test a complete 3D image? If I have a 3D image (512512300), how to generate the nodule? Is there a such test code?
In addition, how to generate the confusion matrix and metrics?

about the dataprcessing

陈老师，你好
我运行数据预处理部分的LUNA_node_extraction.py时，本来应该是生成classification文件夹的，但是我运行后没有结果。调试了一下代码发现line 93处的 if mini_df.shape[0] > 0: 这个判断都没有进去。尝试在此之前添加了一句print（mini_df.shape[0]），发现全部都是零。
多谢老师指教

Coordinate Transformation Problem

In the ‘candidates.csv’, why is it that the nodule coordinates minus the origin coordinates get negative?

Training problems

如何为解决layer.py文件中“from ResNet3d.layer import (conv3d, normalizationlayer, max_pool3d, resnet_Add, weight_xavier_init, bias_variable, dense_to_one_hot)报错的问题”

ValueError: cannot reshape array of size 16 into shape (16,96,96)

您好，在使用您提供的模型做segmeation预测时，产生了ValueError: cannot reshape array of size 16 into shape (16,96,96)问题，想请教您如何解决

良恶性分类

您好，感谢您的代码！但在运行分类代码的过程中遇到了无法生成分类文件0和1的问题，请问怎么解决？万分感激。

dataAnaly.py

请问现在还提供数据集开放下载么

data3prepare.py

您好：
在data3prepare.py中，如果不想切出太多的sub_image，可以把numberxy=10, numberz=6
改成numberxy=3, numberz=20吗？

false

请问第四步中segmentation文件夹是如何获取的

您好，在第四步遇到了一些疑惑想向您请教。1-3步顺利得到process文件夹，4看了您分享的已完成的csv文件内的路径，但我不知道如何获取segmentation文件夹，谢谢，祝好。

How to run Step 4

Hello, when you run your program, you encounter some doubts. When I finished running Parts 1-3, I encountered difficulties in running Part 4. I don't know how to save the generated mask to the CSV file. Do you need to write your own code, or which Python file do you want to run? I hope to get your help.
Thank you.

结节假阳性二分类数据划分

请问在肺结节假阳性二分类的时候，代码结果是采用的了全部的结节数据集吗？因为我看到这个任务的正样本有1300+而负样本有5'000'000+

How to run inference on an external .mhd file? (which scripts and in which order)

GPU

Hello !
On what GPU did you run the training?
Thanks !

预测结果拼接

请问在得到预测结果后，可以如何根据步长等参数对预测图片拼接成512*512的图片？
谢谢！ @junqiangchen

subset.py的一些疑惑？

您好！正在复现您项目的分类部分，遇到了点问题，不是很理解subset.py文件的运行机理，
具体是里面的sys.argv[1]所对应的文件是什么？望解惑！感谢

假阳性减少

您好，我之前用luna16数据集已经进行了肺结节的检测，检测结果是假阳性高，想用您的假阳性减少这一模块来去假阳。我看到ResNet3d_train.py这代码里面涉及到nodel_all_train.csv这一文件，但是我没有找到这一文件。请问是不能直接用去假阳这一模块吗？还是有什么解决办法呢？谢谢，祝好！

执行了3、4天的epoch是5吗？

抱歉，又来打扰了，我可能对工程做了点修改，根据现在的情况，感觉单卡训练5个epoch要3、4天；

因为5个epoch确实比着其他工程确实太少了，不太确定是不是我的修改有问题，下图（工程里的loss曲线时间是4天）的时间就是执行了5个epoch的时间吗？

请问用作结节分割的mask的生成中是否是正负样本均有的

我运行完程序，把您生成的bmp文件改为生成png图片，查看生成的结节mask，大约每个文件夹有一半是纯黑色图片，是有些是负样本吗

良恶 trainloss不下降

你好，我按照你的代码执行良恶数据处理后训练后loss一直在0.5-0.6几震荡，增加了epoch和BN层以及去除数据增强都没有作用，请问你在LUNA2022中得到的训练结果图是执行的这个代码吗？

关于dice loss

您好，我复现了您的程序，觉得很有收获，只是有一个疑问就是您的loss = -dice，我觉得这里应该是1-dice。祝好。

数据集问题

您好，请问您的原始数据是怎么存放的，可以给出您数据的目录吗？

Mis-matching Total number of candidates

Hello,

after evaluating the predictions file (i.e. METU_VISION_FPRED.csv) using the LUNA16 official script (i.e. noduleCADEvaluationLUNA16.py), I get an output (CADAnalysis.txt) like this :

CADAnalysis.txt

CAD Analysis: METU_VISION_FPRED

Candidate detection results:
True positives: 1128
False positives: 81706
False negatives: 58
True negatives: 0
Total number of candidates: 87794
Total number of nodules: 1186
Ignored candidates on excluded nodules: 4600
Ignored candidates which were double detections on a nodule: 360
Sensitivity: 0.951096121
Average number of candidates per scan: 98.867117117

From what I understand, the the Total number of candidates should be 754,975 according to https://luna16.grand-challenge.org/Evaluation/

Even though METU_VISION_FPRED.csv has 754,975 candidates!! the CADAnalysis.txt still says Total number of candidates: 87794 !!!!!

Looking forward to hearing back from you. Any help is extremely appreciated.

Many thanks.

Any idea why that happens?? Am I missing something?

测试结果的计算

很抱歉，再次打扰一下，想问一下工程里有关于测试结果统计的脚本吗？比如工程中的这个

不理解怎么处理？

Link to download trained model not working

Hi @junqiangchen
Can you fix the link to download the trained model? Thank you so much.

dataprocess问题

请问有没有pytorch版本的代码，感谢

请问有没有pytorch版本的代码，十分感谢！！

良恶性分类

您好，我阅读了您的文章并尝试了分类相关的程序，您提到的良恶性分类，然后实际上 0 代表非结节， 1 代表结节，没有良恶性相关的区分，我有点困惑您提到的良恶性分类是如何做到的

Ensembling both models for inference

Hello,

Thanks for sharing your work. Are the 2 models meant to ensemble, chain or otherwise combine in some way? I am wondering how you use the Resnet to reduce FP in the Vnet mask output. Thanks!

Augmain.py file is not executing

Dear Sir, your work is helped me a lot in that I'm facing following problem,
please suggest how to run Augmain.py in spyder. when I run it showing following error:

File "D:\mywork\LUNA16 Master\Augmain.py", line 1, in
from dataaugmentation.Augmentation.ImageAugmentation import DataAug3D

ModuleNotFoundError: No module named 'dataaugmentation'

So, please provide the solution for above problem.

thank for advance

运行的几点疑惑

你好，看过工程的readme，有几点疑惑希望能够得到解答；

1、中看过处理的9个步骤，我的理解是1～4重新生成了一次图片的掩膜文件和mhd中的坐标、和尺寸信息，根据luna官方给的数据是不是5～9才是必要的？

2、工程需要的tensorflow是那个版本，仅用GPU可以跑吗？

我想将其转为pytorch来实现，如果你感觉有什么建议或者要注意的，希望可以指教一下，拜谢！

data3dprepare.py

师兄您好，请问您运行data3dprepare.py共花了多长时间呀？我按您的代码，运行3天了，最后崩在了第154-179个生成文件上面~~是不是我是不是哪里出了问题呀？暂时没改变您在data3dprepare.py里面写的参数~~

您好，请问一下ResNet3d_predict.py中的nodel_all_test.csv是哪个文件生成的？

格式是哪样的？

关于模型检测的问题

我发现，如果我用全黑或者全白的图片放入您的V-net中进行检测，结果却显示有结节，这显然是不科学的，但我并不知道有什么问题，不知道您有尝试过吗？

Issues with subset.py

issue with Vnet Train

Dear Sir thanks for your post it helped a lot. But I'm getting following issue when executing Vnet Training:

from Vnet.model_vnet3d import Vnet3dModule

ModuleNotFoundError: No module named 'Vnet'

I'm using Anacond in Windows.

So please suggest the solution for above issue

module 'tensorflow' has no attribute 'placeholder'

When I am using this method:
Vnet3d = Vnet3dModule(128, 128, 16, channels=1, costname=("dice coefficient",), inference=True,
model_path="/content/drive/MyDrive/model/resnet.pd-50000.data-00000-of-00001")

I got this error:

in init(self, image_height, image_width, image_depth, channels, costname, inference, model_path)
195 self.channels = channels
196
--> 197 self.X = tf.placeholder("float", shape=[None, self.image_depth, self.image_height, self.image_width,
198 self.channels])
199 self.Y_gt = tf.placeholder("float", shape=[None, self.image_depth, self.image_height, self.image_width,

AttributeError: module 'tensorflow' has no attribute 'placeholder'

更新后data3dprepare.py异常

师兄您好，这几天我运行了一下您的程序，发现您更新了data3dprepare.py，但运行到
hr_samples[0, 0:blockz, 0:block_width, 0:block_height] = image[0:rangz, 0:rangwidth, 0:rangheight]
的时候报错，
TypeError: slice indices must be integers or None or have an index method
然后我发现rangz rangwidth rangheight 在前几行的计算过程得出的貌似不是整数型，
rangz = lambda imagez, blockz: imagez if imagez < blockz else blockz
rangwidth = lambda width, block_width: width if width < block_width else block_width
rangheight = lambda height, block_height: height if width < block_height else block_height
您看这里是不是需要int强制转换一下？？？