Giter Site home page Giter Site logo

tabular_synthesizer's Introduction

Model Comparison and Evaluation

In this section, we will compare the performance of the two models for synthesizing tabular data using statistical methods and similarity measures. We will focus on K.L divergence for statistical comparison and cosine similarity for measuring similarity.

1. Statistical Comparison - K.L Divergence

1.1 GAN with Sigmoid Activation vs. GAN with Softmax Activation

  • File: tickets-gans.ipynb, Section 1 and 2

K.L Divergence Results:

  • Describe the process and results of comparing the synthesized data from GAN with Sigmoid Activation and GAN with Softmax Activation using K.L divergence.
  • Interpret the K.L divergence values and discuss the statistical significance.

1.2 VAE with Sigmoid Activation vs. VAE with Softmax Activation

  • File: tickets-gans.ipynb, Section 3 and 4

K.L Divergence Results:

  • Describe the process and results of comparing the synthesized data from VAE with Sigmoid Activation and VAE with Softmax Activation using K.L divergence.
  • Interpret the K.L divergence values and discuss the statistical significance.

2. Similarity Measures - Cosine Similarity

2.1 GAN with Sigmoid Activation vs. GAN with Softmax Activation

  • File: tickets-gans.ipynb, Section 1 and 2

Cosine Similarity Results:

  • Describe the process and results of comparing the synthesized data from GAN with Sigmoid Activation and GAN with Softmax Activation using cosine similarity.
  • Discuss the implications of the cosine similarity values on the similarity between the datasets.

2.2 VAE with Sigmoid Activation vs. VAE with Softmax Activation

  • File: tickets-gans.ipynb, Section 3 and 4

Cosine Similarity Results:

  • Describe the process and results of comparing the synthesized data from VAE with Sigmoid Activation and VAE with Softmax Activation using cosine similarity.
  • Discuss the implications of the cosine similarity values on the similarity between the datasets.

Conclusion

  • Summarize the findings from both statistical comparison and similarity measures.
  • Provide insights into the strengths and limitations of each model.
  • Discuss potential areas for improvement or further exploration.

These comparisons provide a comprehensive understanding of how well the synthesized data aligns with the original data and the differences between the two models in achieving this goal.

Models Link https://drive.google.com/drive/u/2/folders/1eEw2LhuK3aMJziK-6VmmvY3kVS-44vxg

tabular_synthesizer's People

Contributors

ahmedhassan187 avatar

Watchers

 avatar

tabular_synthesizer's Issues

VAE with sigmoid

Feature

As a Data scientist: I want to predict all features with my VAE model and reconstruct it again in the shape of the data.

K.L divergence

Feature

As a Data scientist: I want to compare these four models and the approach of each one and measure its performance.

GAN softmax

Feature

As a Data scientist: I want to predict all features with my GAN model and reconstruct it again in the shape of the data.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.