About Speedy

AngusLooney · ‎05-14-2019

Very interesting, only skim read so far. One thing I would 100% endorse is the need for "instrumentation" on jobs, to log their execution, starts and stops. This is critical really, however you do it. Definitely recognise the issues around naming conventions and structured approaches. To me, the need over a pervasive, overarching conceptual framework rapidily becomes critical. In a way, we're dealing with a series of levels of consideration: - the sequence and interdependancy of jobs within a "flow" - the sequence and interdependancy of flows within an "estate" I've recently been looking into the ideas/concepts around decomposing ingest processes into completely decoupled stages, where bundles of data transistion through a series of states, where those transistion happen by the actions of jobs/flows, being read as input and written as output, which is then the input the downstream processes. The ideas of "data queues" and viewing instances of flows almost like "worker threads", including mutiple parallelised instances of the same flow, action on discretly allocated collections of the bundles of data. It started from looking at the challenges of ingesting very large volumes of raw data files, particularly XML where neither "by file" or "all in one go" approaches are performant or sustainable. Starts to morph into streaming territory, queues, prioritisation, "backpressure" and the like.

Patrick · ‎04-06-2019

@Speedy Data and Code migration are two very different areas. For data: Using representative data for development is important but using full volume production data for this is often not a good idea because A) full volumes slow down development (i.e. long running jobs) and B) PII/sensitive data should not be used in a Dev environment with normally much less security applied. For DIS "code"/SAS Metadata The process for moving data from one environment to another should be the similar so use what has been used to get DIS jobs into Prod in first place (if that is already a successful and established process).

LinusH · ‎04-06-2019

If metadata have been updated, analyse dependencies. Export relevant metadata into spk Import spk in tharget environment, including meta data mappings to target environemnt "global" meta data Check import log Review imported metadata Re-deploy jobs Test If you are working with releases, you probably have some kind release docementation you need to update, so you can keep track on what to lift further (UAT/prod).

Online Status	Offline
Date Last Visited	‎10-27-2021 07:35 PM

DI Studio mappings

Backup and restore of production environment

Promotion of a batch job from Dev to QA

Re: Backup and restore of production environment

Re: Promotion of a batch job from Dev to QA

Re: DI Studio mappings

Re: Backup and restore of production environment

Re: Promotion of a batch job from Dev to QA