Giter Site home page Giter Site logo

dataspherestudio-doc's Introduction

DataSphereStudio

License

English | 中文

Introduce

       DataSphere Studio (DSS for short) is a one-stop data application development and management framework developed by WeBank.

       Under a unified UI, DataSphere Studio provides a workflow-based graphical drag-and-drop development experience, which will satisfy the requirements from data exchange, desensitization cleaning, analysis and mining, quality inspection, visual display, timing scheduling to Data output applications, etc., data application development full-process scenario requirements.

       **DSS is designed with a pluggable integration framework, allowing users to easily and quickly replace various functional components that DSS has integrated, or add new functional components according to their needs. **

Integrated application tools

       DSS has integrated a variety of upper-layer data application systems by implementing multiple AppConns, which can basically meet the data development needs of users.

       If desired, new data application systems can also be easily integrated to replace or enrich DSS's data application development process. Click me to learn how to quickly integrate new application systems

Utility Description DSS0.X compatible version (DSS0.9.1 recommended) DSS1.0 compatible version (DSS1.1.0 recommended)
Linkis Computing middleware Apache Linkis, by providing standard interfaces such as REST/WebSocket/JDBC/SDK, upper-layer applications can easily connect and access underlying engines such as MySQL/Spark/Hive/Presto/Flink. Linkis0.11.0 is recommended (*Released *) >= Linkis1.1.1 (Released)
DataApiService (DSS has built-in third-party application tools) data API service. The SQL script can be quickly published as a Restful interface, providing Rest access capability to the outside world. Not supported DSS1.1.0 recommended (Released)
Scriptis (DSS has built-in third-party application tools) support online writing of SQL, Pyspark, HiveQL and other scripts, and submit to [Linkis](https ://github.com/WeBankFinTech/Linkis) data analysis web tool. Recommended DSS0.9.1 (Released) Recommended DSS1.1.0 (Released)
Schedulis Workflow task scheduling system based on Azkaban secondary development, with financial-grade features such as high performance, high availability and multi-tenant resource isolation. Recommended Schedulis0.6.1 (released) >= Schedulis0.7.0 (Released)
EventCheck (a third-party application tool built into DSS) provides signal communication capabilities across business, engineering, and workflow. Recommended DSS0.9.1 (Released) Recommended DSS1.1.0 (Released)
SendEmail (DSS has built-in third-party application tools) provides the ability to send data, all the result sets of other workflow nodes can be sent by email DSS0.9.1 is recommended (released) Recommended DSS1.1.0 (Released)
Qualitis Data quality verification tool, providing data verification capabilities such as data integrity and correctness Qualitis0.8.0 is recommended (**Released **) >= Qualitis0.9.2 (Released)
Streamis Streaming application development management tool. It supports the release of Flink Jar and Flink SQL, and provides the development, debugging and production management capabilities of streaming applications, such as: start-stop, status monitoring, checkpoint, etc. Not supported >= Streamis0.2.0 (Released)
Prophecis A one-stop machine learning platform that integrates multiple open source machine learning frameworks. Prophecis' MLFlow can be connected to DSS workflow through AppConn. Not supported >= Prophecis 0.3.2 (Released)
Exchangis A data exchange platform that supports data transmission between structured and unstructured heterogeneous data sources, the upcoming Exchangis1. 0, will work with DSS workflow not supported = Exchangis1.0.0 (Released)
Visualis A data visualization BI tool based on the secondary development of Davinci, an open source project of CreditEase, provides users with financial-level data visualization capabilities in terms of data security. Recommended Visualis0.5.0 = Visualis1.0.0 (Released)
DolphinScheduler Apache DolphinScheduler, a distributed and easily scalable visual workflow task scheduling platform, supports one-click publishing of DSS workflows to DolphinScheduler. Not supported DolphinScheduler1.3.X (Released)
UserGuide (DSS will be built-in third-party application tools) contains help documents, beginner's guide, Dark mode skinning, etc. Not supported >= DSS1.1.0 (Released)
DataModelCenter (the third-party application tool that DSS will build) mainly provides data warehouse planning, data model development and data asset management capabilities. Data warehouse planning includes subject domains, data warehouse hierarchies, modifiers, etc.; data model development includes indicators, dimensions, metrics, wizard-based table building, etc.; data assets are connected to Apache Atlas to provide data lineage capabilities . Not supported Planned in DSS1.2.0 (under development)
UserManager (DSS has built-in third-party application tools) automatically initialize all user environments necessary for a new DSS user, including: creating Linux users, various user paths, directory authorization, etc. Recommended DSS0.9.1 (Released) Planning
Airflow Supports publishing DSS workflows to Apache Airflow for scheduled scheduling. PR not yet merged Not supported

       Due to the high risk of script execution supported by DataSphere Studio, the isolation of the WeDataSphere Demo environment has not been completed. Considering that everyone is consulting the Demo environment, it is decided to first issue invitation codes to the community and accept trial applications from enterprises and organizations.

       If you want to try the Demo environment, please join the DataSphere Studio community user group Join the group to jump, and contact the team members to get the invitation code.

       DataSphereStudio Demo environment user registration page: Click me to enter

       DataSphereStudio Demo environment login page: Click me to enter

Documentation directory

       Please visit English Documentation for a complete list of DSS documents.

       The following is the installation guide for DSS-related AppConn plugins:

Who is using DataSphere Studio

       We have created a Who is using DSS issue for user feedback and documentation of who is using DSS, you are welcome to register.

       DSS Since the open source release in 2019, there have been more than 700 test companies and 1,000+ sandbox test users, covering industries such as finance, telecommunications, manufacturing, and the Internet.

Documentation

       DataSphere Studio uses GitBook for management, and the entire project will be organized into a GitBook e-book for everyone to download and use.

       WeDataSphere A unified document reading entry will be provided in the future. For the usage of GitBook, please refer to: GitBook Documentation

contribute

       We welcome and look forward to more contributors to build DSS, whether it is code, documentation, or other forms of contribution that can help the community.

       Any questions and suggestions about DSS, please submit an issue for tracking processing and experience sharing.

       You can also scan the QR code below to join our WeChat/QQ group for faster response.

comminicate

License

       DSS is under the Apache 2.0 license. See the License file for details.

dataspherestudio-doc's People

Contributors

casionone avatar demonray avatar elishazhang avatar fengt-en avatar hmhwz avatar jackchen0810 avatar jinyangrao avatar liuyou2 avatar ruy9527 avatar wayson20 avatar wushengyeyouya avatar yangzhiyue avatar yuankang134 avatar yusiwen avatar zqburde avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

dataspherestudio-doc's Issues

编译安装对环境有哪些要求?

编译安装对环境有哪些要求?

可否再具体一些?
我看到咱们源码有有scala,我用scala3编译总是有各咱depre...之类的提示。用scala2.13.x也有类似提示。

可否再详细一些。就那么几个字,当真是很难哦。

spring-web接入规范中提到的是springboot-web的包

2.1.1 Spring Web 应用实现 DSS 一级规范中提到的spring-web接入方式使用的是spring-boot的过滤器FilterRegistrationBean, 我现在有个开源的组件想接入进来,该组件只依赖了spring-web,并没有spring-boot-web包,请教下应该如何接入?

通过IDE执行sql出现Failed to create engineResource

image
image
image

2022-07-20 10:59:13.059 INFO Program is substituting variables for you
2022-07-20 10:59:13.059 INFO Variables substitution ended successfully
2022-07-20 10:59:13.059 WARN The code you submit will not be limited by the limit
Job with jobId : nodeexecution_hadoop_hive_1 and execID : nodeexecution_hadoop_hive_1 submitted
2022-07-20 10:59:13.059 INFO You have submitted a new job, script code (after variable substitution) is
SCRIPT CODE
show database
SCRIPT CODE
2022-07-20 10:59:13.059 INFO Your job is accepted, jobID is exec_id018008linkis-cg-entrancedss:9104nodeexecution_hadoop_hive_1 and taskID is 18 in ServiceInstance(linkis-cg-entrance, dss:9104). Please wait it to be scheduled
job is scheduled.
2022-07-20 10:59:13.059 INFO Your job is Scheduled. Please wait it to run.
Your job is being scheduled by orchestrator.
2022-07-20 10:59:13.059 INFO job is running.
2022-07-20 10:59:13.059 INFO Your job is Running now. Please wait it to complete.
2022-07-20 10:59:13.059 INFO Job with jobGroupId : 18 and subJobId : 18 was submitted to Orchestrator.
2022-07-20 10:59:13.059 INFO Background is starting a new engine for you,execId astJob_3_codeExec_3 mark id is mark_3, it may take several seconds, please wait
2022-07-20 10:59:14.059 ERROR Job with execId-nodeexecution_hadoop_hive_1 + subJobId : 18 execute failed,21304, Task is Failed,errorMsg: errCode: 12003 ,desc: dss:9101_3 Failed to async get EngineNode AMErrorException: errCode: 30001 ,desc: Failed to create engineResource ,ip: dss ,port: 9101 ,serviceKind: linkis-cg-linkismanager ,ip: dss ,port: 9104 ,serviceKind: linkis-cg-entrance

2022-07-20 10:59:14.059 INFO job is completed.
2022-07-20 10:59:14.059 INFO Task creation time(任务创建时间): 2022-07-20 10:59:13, Task scheduling time(任务调度时间): 2022-07-20 10:59:13, Task start time(任务开始时间): 2022-07-20 10:59:13, Mission end time(任务结束时间): 2022-07-20 10:59:14
2022-07-20 10:59:14.059 INFO Your mission(您的任务) 18 The total time spent is(总耗时时间为): 568 ms
2022-07-20 10:59:14.059 INFO Sorry. Your job completed with a status Failed. You can view logs for the reason.

通过命令行执行验证“sh bin/linkis-cli -submitUser hadoop -engineType hive-2.3.3 -codeType hql -code "show tables"”,能够正常返回结果。
但是通过 dss页面,就会提示无法创建资源。
求求大佬看一眼

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.