Giter Site home page Giter Site logo

Comments (5)

Vivek-Merugu avatar Vivek-Merugu commented on August 15, 2024 1

The problem happened because there wasn't enough memory allocated to the Spark executor. After increasing the Spark executor memory, the issue was resolved.

Hence, closing this issue.

from abris.

cerveada avatar cerveada commented on August 15, 2024

What makes you believe that this issue is caused by Abris?

from abris.

Vivek-Merugu avatar Vivek-Merugu commented on August 15, 2024

Thank you for your response. I appreciate your assistance in addressing this issue. Here's a breakdown of the situation:

Old Build Overview:

  • The initial Spark Streaming Scala code processed JSON messages from Kafka Topic A.
  • Operations were performed on these messages, and the results were published to Kafka Topic B.
  • Successfully deployed on an Amazon EMR cluster, running smoothly.

New Build Overview:

  • Transitioned to receiving Confluent AVRO messages from Kafka Topic A in the same application.
  • Utilized ABRiS library functions (from_avro() and to_avro()) to deserialize AVRO to JSON and vice versa.
  • The rest of the code remained unchanged.
  • Successfully deployed on an Amazon EMR cluster, initially working fine for the first 20 to 30 minutes.

Error in New Build:

After the initial working period, the streaming application encountered the following error:

ERROR Client: Application diagnostics message: Application application_1696171462852_0173 failed 2 times due to AM Container for appattempt_1696171462852_0173_000002 exited with exitCode: 137
Failing this attempt.Diagnostics: Container killed on request. Exit code is 137
Container exited with a non-zero exit code 137.
Killed by external signal

Investigation Steps:

  • Referenced the AWS Knowledge Center article here for troubleshooting.
  • Despite following the suggested steps, the issue persists.

Specific Queries:

  • Seeking your input on the possibility of ABRiS library functions (from_avro() and to_avro()) causing the issue.
  • Additionally, exploring other potential factors, such as message payload size, that might be contributing to the problem.

Your insights and suggestions on resolving this matter would be greatly appreciated.

from abris.

cerveada avatar cerveada commented on August 15, 2024

Abris doesn't really works with json, it converts from avro to spark DataFrame. So it could also be some issue with json.

Best would be to use some profile to see what is really happening in the memory.

One Idea I have is the registryConfig. Do your application uses avro with many different registryConfig maps? That could cause a memory leak since the registry clients are cached for each config.

from abris.

Vivek-Merugu avatar Vivek-Merugu commented on August 15, 2024

ABRiS Library Usage:

  • Yes, I am utilizing the ABRiS library function from_avro() to convert AVRO to Spark DataFrame and then transforming the Spark DataFrame to JSON messages and vice-versa.

RegistryConfig in Use:

  • The application employs three distinct registryConfig maps.

Memory Monitoring:

  • Certainly, I will implement memory monitoring and keep you updated.

from abris.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.