C code exported by porter has wrong data type for feature value as double which will c

attaching zip file contains C program trained for 10000 record

Decision tree C code exported by porter has wrong datatype for features array it should be float about sklearn-porter HOT 4 OPEN

nok commented on May 31, 2024

Decision tree C code exported by porter has wrong datatype for features array it should be float

from sklearn-porter.

Comments (4)

nok commented on May 31, 2024

Can you please provide some data and code for comparison?

(There is a bigger difference between the internal and textual representation of values in Python I guess.)

from sklearn-porter.

vijaykilledar commented on May 31, 2024

ok I will provide detail example/data tomorrow.

from sklearn-porter.

vijaykilledar commented on May 31, 2024

attaching zip file contains

C program trained for 10000 records with accepting feature float data type
C program trained for 10000 records with accepting feature double data type
Shell script used to calculate the matched records of target binary of above programs
Test data set file
Expected prediction data file
porter_attachments.zip
csv file used for training (First column as Target class, and rest of the column as test data set)
train_10000.zip

test script output at my end

./test_prediction.sh ./train_10000 ./train_10000_target ./porter_train_10000_double 
test data file - test_data/train_10000
expected prediction data file - test_data/train_10000_target
testing output binray by feeding training data .......
Total records - 10000
Matched prediction records - 9878

./test_prediction.sh ./train_10000 ./train_10000_target ./porter_train_10000 _float
test data file - test_data/train_10000
expected prediction data file - test_data/train_10000_target
testing output binray by feeding training data .......
Total records - 10000
Matched prediction records - 9992

from sklearn-porter.

nok commented on May 31, 2024

Okay, thanks. Can you please validate the data type of your training data?

print(type(X[0]))  # <type 'numpy.float32'> or <type 'numpy.float64'>

For load_digits it's numpy.float64 which is double in C. The integrity check finished without mismatches. So I changed the data to floats with X.astype(np.float32) and finished the integrity check again without errors.

Nevertheless it depends on the data. In general I see the problem of point precisions between data types and programming languages. It could make sense to add a possibility to change the features data type in transpiled output by using a new argument temp_dtype='float'.

Further atof() converts a string to double in C. On the other hand if you want to use floats, you should use strtof() to convert strings to float.

Can you test it?

from sklearn-porter.

Recommend Projects

Decision tree C code exported by porter has wrong datatype for features array it should be float about sklearn-porter HOT 4 OPEN

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent