Giter Site home page Giter Site logo

azure / data-product-analytics Goto Github PK

View Code? Open in Web Editor NEW
72.0 18.0 24.0 11.42 MB

Template to deploy a Data Product for analytics and data science use-cases into a Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Product template can be used by cross-functional teams to create insights and products for external users.

License: MIT License

PowerShell 7.48% Dockerfile 0.56% Shell 20.06% Bicep 71.90%
arm azure architecture data-platform enterprise-scale policy-driven data-mesh data-fabric bicep enterprise-scale-analytics

data-product-analytics's Issues

Compute Cluster Deployment Fails for Policy DNS A-record Deployment

Describe the bug
If the user deploys this template, the CPU Cluster and GPU Cluster deployments fails, with the following error message:

{
    "status": "Failed",
    "error": {
        "code": "UserError",
        "message": "{\n  \"error\": {\n    \"code\": \"UserError\",\n    \"severity\": null,\n    \"message\": \"Provided EnableNodePublicIp 'False' requires workspace private endpoint.\",\n    \"messageFormat\": null,\n    \"messageParameters\": null,\n    \"referenceCode\": null,\n    \"detailsUri\": null,\n    \"target\": null,\n    \"details\": [],\n    \"innerError\": null,\n    \"debugInfo\": null,\n    \"additionalInfo\": null\n  },\n  \"correlation\": {\n    \"operation\": \"\",\n    \"request\": \"\"\n  },\n  \"environment\": \"eastus2\",\n  \"location\": \"eastus2\",\n  \"time\": \"2021-08-02T12:17:49.9687945+00:00\",\n  \"componentName\": \"machinelearningcompute\"\n}",
        "details": []
    }
}

Steps to reproduce

  1. Deploy this template.
  2. See how it fails.

Screenshots

Documentation: about section

Documentation Issue

There doesn't seem to be a way to submit a change in the 'about' section. See below changes.

Template to deploy a Data Product for analytics and data science use-cases into a Data Landing Zone of the Cloud-Scale Analytics Scenario, (former Enterprise-Scale Analytics, Data Management & Analytics Scenario). The Data Product template can be used by cross-functional teams to create insights and products for external users.

Code of Conduct

Update Portal Experience

  • Remove VS Code reference for CIs
  • Add option to connect Container Registry
  • Change default option for DNS A-records (deployed through Policy)

Name change

Please change all instances of "Enterprise-scale analytics and ai" to "Data Management and Analytics Scenario" to align to marketing ask. This should happen for one-clicks and all documentation.

Wrong Name for Container Build Compute

Describe the bug
The bicep templates reference the wrong name for the container build compute. This needs to be fixed to be able to build images in the customer environment.

Steps to reproduce

  1. Deploy Data Product Analytics through any of the of the available options
  2. Try to build a container image in AML
  3. Wrong compute name is referenced.

Screenshots
No Screenshots

Feature Request

Describe the solution you'd like
In the prereqs we need more clarity on enterprise landing zone for data being optional to deploy the asset and also need clarity on the min number of assets that needs to be deployed for deploying AML. For instance we just need data factory to deploy along with AML and do not need synapse, cog services and so on..

Feature Request: Add Information about Service Endpoint Requirements

Describe the solution you'd like
Please add to the docs as well as the Portal experience information about the service endpoint requirements of Azure Machine Learning. Azure Machine Learning and the computes require the Storage Service Endpoint to be enabled on the subnet that is used.

"Deployment 'synapse001' could not be found" error

Describe the bug
When deploying this solution with the processingService parameter set to "dataFactory", you receive a deployment error:
Deployment 'synapse001' could not be found.

Steps to reproduce

  1. Set processingService to "dataFactory"
  2. Run the deployment
  3. Get the error
  4. ???
  5. Profit!!!

Screenshots

(Pull request to fix this is forthcoming.)

Bug: AzureML cannot access images in Azure Container Registry with Quarantine flag

Deployment Mode

Deploy To Azure

Steps to reproduce

  1. Deploy solution
  2. Run AzureML job or Deployment online-endpoint - samples: Samples/SDK v2/sdk/python/endpoints/online/managed/online-endpoints-simple-deployment.ipynb

Error Message

(None) ResourceNotFound: Deployment failed due to timeout while waiting for Environment Image to become available. Check Environment Build Log in Workspace or Workspace storage for potential failures. Environment info: Name: CliV2AnonymousEnvironment, Version: f53727d1df8137354bc0f2c69d2619ee, Build log may be available at (log Please see troubleshooting guide, available here: https://aka.ms/oe-tsg#error-resourcenotfound Code: None

Screenshots

No response

Code of Conduct

ACTION REQUIRED: Microsoft needs this private repository to complete compliance info

There are open compliance tasks that need to be reviewed for your data-product-analytics repo.

Action required: 4 compliance tasks

To bring this repository to the standard required for 2021, we require administrators of this and all Microsoft GitHub repositories to complete a small set of tasks within the next 60 days. This is critical work to ensure the compliance and security of your Azure GitHub organization.

Please take a few minutes to complete the tasks at: https://repos.opensource.microsoft.com/orgs/Azure/repos/data-product-analytics/compliance

  • The GitHub AE (GitHub inside Microsoft) migration survey has not been completed for this private repository
  • No Service Tree mapping has been set for this repo. If this team does not use Service Tree, they can also opt-out of providing Service Tree data in the Compliance tab.
  • No repository maintainers are set. The Open Source Maintainers are the decision-makers and actionable owners of the repository, irrespective of administrator permission grants on GitHub.
  • Classification of the repository as production/non-production is missing in the Compliance tab.

You can close this work item once you have completed the compliance tasks, or it will automatically close within a day of taking action.

If you no longer need this repository, it might be quickest to delete the repo, too.

GitHub inside Microsoft program information

More information about GitHub inside Microsoft and the new GitHub AE product can be found at https://aka.ms/gim or by contacting [email protected]

FYI: current admins at Microsoft include @marvinbuss, @daltondhcp, @esbran

Bug: Alerts.bicep not alerting

Deployment Mode

Azure DevOps

Steps to reproduce

When deployed the new logging elements, for example Synapse pipeline errors did not trigger any alerts.

In the script, the 'static threshold' operator is "GreaterThan 1" when that likely should be "GreaterThanOrEqual 1".

AFAIK, this exists in other products' Alerts.bicep files as well.

Error Message

No response

Screenshots

No response

Code of Conduct

AML Compute Deployments fail

Describe the bug
The AML Compute Cluster and Compute Instance deployments fail, when deploying to a region, where the 'no public IP' feature is not available. Hence, we have to change the configuration depending on the selected region of the deployment.

Steps to reproduce

  1. Deploy Data Product via ADO, GitHub or Portal
  2. Select a region like "UK South" where the feature is not supported
  3. Deployment fails.

Screenshots
image

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.