How a Telco Redesigned Its Integration | NILUS

⏱ 24 min read

The most interesting part of this story is not that a telco used ArchiMate. ArchiMate training

Plenty of enterprises use ArchiMate and get almost no practical value from it. They produce diagrams, color-code dependencies, hold architecture reviews, and still cannot answer the one question the business keeps coming back to: why does the customer journey keep breaking in the same places?

This telco hit that wall hard.

An in-store order looked “completed” in CRM. Billing still showed “pending.” The retail POS had no real final state at all, just a spinner that eventually timed out or a vague success message that turned into a support ticket later. The call center saw something else again. Partner dealers had their own version of truth. E-commerce orders followed a slightly different path, which meant a slightly different failure mode. Every urgent issue became an integration war room. The integration team was drowning in interface fixes labeled critical, high priority, customer impacting, revenue impacting, regulator visible, or, quite often, all of the above.

And this was not some small operator. It had a substantial retail footprint: branded stores, partner dealers, e-commerce, call center sales, field sales. Underneath that sat the familiar accumulation of telecom history: legacy BSS and OSS platforms, CRM, product catalog, order management, provisioning stacks, billing, an ESB that had once been called strategic, APIs behind a gateway, Kafka introduced by a digital team, MQ still hanging around, and nightly batch jobs nobody was especially eager to touch.

So no, this was not an ArchiMate exercise because the architecture team wanted prettier models. ArchiMate modeling guide

It was a redesign effort because operations were failing in ways customers could actually see.

That distinction matters. More than most architecture teams admit. EA governance checklist

ArchiMate only became useful when the team stopped treating it as a documentation layer and started using it to expose design mistakes that delivery teams had quietly normalized.

Start with the first mistake: modeling the current mess too literally

I have seen this more than once, and, to be fair, I have made the same mistake myself. Someone says, “Let’s understand the current state,” and what follows is a giant integration diagram with every interface, every protocol, every payload transformation, every scheduled job, every middleware route, every topic, every queue, every retry path. It turns into a wallpaper-sized artifact that looks impressive for about 90 seconds.

Then nobody can reason with it.

That happened here. The team built a current-state integration map of the retail activation journey. It included POS, CRM, order capture, credit check, billing, provisioning, notification service, ESB mediation, IAM token exchanges, third-party financing checks, and downstream status updates. More than 40 flows across synchronous APIs, MQ, ESB mediation, and asynchronous notifications.

It was detailed. It was technically accurate. And it was very close to useless.

Why? Because it mixed everything together too early. Business-critical interactions sat next to low-value technical noise. A token validation step got the same visual weight as the decision about whether a customer could complete a plan purchase. A payload mapping in middleware looked as important as the application actually accountable for order orchestration. Architecture review sessions drifted into inventory management. People argued about whether an interface was REST or SOAP while nobody could answer a much more important question: who owns the customer outcome when this journey fails halfway through?

That was the real issue in the retail SIM activation journey. The old diagram showed plenty of movement but almost no accountability. You could see flows. You could not see ownership. You could not tell where orchestration lived, because, in truth, it lived nowhere cleanly. Parts sat in the POS. Parts sat in CRM. Some sequencing was hidden inside ESB flows. Retry logic gave the illusion of resilience while masking the absence of proper exception handling.

That was the first genuinely useful ArchiMate lesson, and honestly it should not even be a lesson, but teams forget it all the time: start from behavior and accountability, not just interfaces. ArchiMate tutorial

The model only became readable when they used a small set of elements deliberately. Business Process to represent the retail journey. Application Service to show what capabilities systems exposed in support of that journey. Application Component to show who was actually doing the work. Flow, Triggering, Serving, Assignment—used with discipline instead of flooding the page.

The practical shift was simple. For the first pass, model only the flows that affect customer outcome, SLA, and failure handling. Leave protocol and payload detail to supporting views.

That one decision cut through weeks of noise.

What the integration landscape looked like before redesign

The original environment was not unusual. That is almost the depressing part.

Retail POS updated customer data directly in CRM. The same POS also called order management for some fulfillment actions but bypassed it for others. CRM triggered billing setup in some product scenarios because somebody, years earlier, had decided CRM was the easiest place to hang that logic. The ESB mediated a subset of flows, but several teams routed around it because they needed speed or because the ESB team was overloaded. Newer digital apps exposed APIs through an API gateway running in the cloud. Older back-office systems still depended on MQ and batch extracts. Kafka existed, mostly because a digital platform initiative had introduced it, but in practice it functioned more as a side channel than a real integration backbone.

There was no coherent integration landscape.

There were only delivery decisions that had accumulated over time and had never been re-owned as architecture.

That distinction explained a lot of the operational pain. Offer eligibility logic had been duplicated across digital and retail channels because each team optimized for local deadlines. Order capture behavior varied by channel because one channel integrated with an “order submission” API while another enriched the payload locally and invoked a different sequence. Number portability failures were especially ugly because rollback behavior had never been designed end to end. Store staff ended up calling support or using manual workarounds that no architecture diagram captured, even though those workarounds were effectively part of the operating model.

Here is the kind of mapping that finally got people aligned:

I like tables like this because they force a team to stop arguing from symptoms. Telco programs love blaming “legacy” in the abstract. Sometimes legacy really is the problem. Often it is simply the place where bad ownership decisions have become visible.

The second mistake: confusing integration technology with integration architecture

I’ll say this plainly: having APIs and Kafka does not mean you have an integration strategy.

Too many teams still confuse the presence of technology with the existence of architecture. API gateway? Fine. But that is not architecture. ESB? Also not architecture. Event bus? Same story. Those are implementation mechanisms. Useful ones. Sometimes essential. But they do not tell you where business behavior belongs, who owns a decision, or what should happen when a customer journey breaks in the middle.

In this telco, APIs often exposed legacy behavior without changing accountability. A “Create Order” API existed, but channel-specific enrichment still happened in three separate systems. Store pickup rules lived outside the supposed order system. Device financing logic sat partly in retail systems and partly in a specialist service. Events were published onto Kafka, but the semantics were inconsistent. One topic suggested a confirmed business milestone; another was really just a technical update masquerading as business state. The ESB contained routing and transformation, yes, but it also contained business sequencing that nobody wanted to admit was process logic. ArchiMate in TOGAF

That matters because hidden business logic in middleware is one of the classic enterprise traps. It gives the appearance of decoupling while making the architecture harder to reason about. The business process is still there—it is just buried in routes, conditional transforms, retry rules, and handlers that operations teams discover at 2 a.m.

ArchiMate helped here because it forced a separation between technology choices and application collaboration. Once the team modeled application behavior and business behavior independently from the technology layer, it became obvious where middleware was carrying business responsibility it should never have owned.

A good rule emerged from that work: if middleware contains sequencing decisions that affect customer outcome, model it explicitly as behavior. Do not leave it as implied plumbing. And if event topics represent business commitments—order accepted, provisioning completed, billing account created—treat those interactions as architecture, not infrastructure.

That sounds obvious on paper. In practice, teams avoid doing it because it exposes messy ownership. Which is precisely why it needs doing.

The workshop where the model finally became useful

There was one workshop, about halfway through the effort, when the whole thing turned.

Not because the notation got better. Because the conversation changed.

In the room were the retail operations lead, the integration lead, the CRM architect, the billing SME, the store systems owner, and the enterprise architect facilitating the session. A few delivery leads joined remotely. This was one of those sessions where people arrived expecting another interface review. Instead, the facilitator put up a business process view for a single journey: a new mobile plan with device financing purchased in store.

That was the right move.

No broad “target architecture” speech. No twenty-system landscape slide. Just one customer journey with money, regulation, channel pressure, and downstream complexity attached to it.

The session moved through four views.

First, a business process view of the retail order journey: offer selection, customer validation, credit decision, consent capture, order submission, fulfillment, status confirmation. Then an application cooperation view showing which systems participated in each step. After that, a service realization view to map which application services actually supported each business step. Finally—and this is the part teams usually skip—an exception view showing what happened when financing approval failed, when provisioning was delayed, when billing account creation succeeded but subscriber migration did not.

That last view changed the tone in the room. Suddenly the architecture was not about idealized flow. It was about operational truth.

And the key insight landed almost embarrassingly fast: there was no single application service representing “retail order submission.”

Everyone had been talking as if that existed. It didn’t.

The store application created the illusion of a coherent submission capability by sequencing multiple calls behind the screen flow. In other words, the channel app was compensating for missing domain structure.

Once people saw that, the redesign target shifted. The question was no longer “how do we modernize these interfaces?” It became “which system should actually own this behavior?”

That is a much better architecture question.

Here’s a simplified sketch of the before-state problem.

Diagram 1 — How a Telco Redesigned Its Integration Landscape Using Archi

Nothing in that picture tells you where the process owner is, because there isn’t one in any clean sense. That was exactly the point.

The redesign principle that mattered most

A lot of sensible principles came out of the redesign, but one mattered more than the rest:

Move orchestration to where accountability belongs.

Before redesign, channel applications behaved like process coordinators. The POS knew too much. It gathered basket details and customer consent, which is fine, but it also kicked off sequencing across credit, billing, subscriber state changes, provisioning, and notifications. Middleware then compensated for weak domain boundaries by transforming payloads and trying to glue the process back together. CRM had become a kind of accidental workflow participant. Billing was being asked questions it should never have owned. The result was brittle coupling everywhere.

The target state was more disciplined.

Order management would own order orchestration. Product catalog would own commercial structure and offer rules. CRM would own customer profile and relationship data, not sales workflow. Billing would own charging setup and account charging configuration, not commercial eligibility. Channel apps would focus on interaction, capture, and customer-facing responsiveness.

That sounds neat because written architecture always sounds neat. Reality was messier than that. The team still had to deal with legacy subscriber platforms, IAM token propagation across old and new stacks, and cloud-native digital services sitting alongside systems that barely understood APIs. But the ownership model became clearer, and once that happened, design tradeoffs became much easier to make.

For retail specifically, the principle had obvious consequences. The POS should capture sale context, identity checks, customer consent, selected basket, maybe even immediate payment authorization. It should not be responsible for sequencing provisioning, billing creation, financing checks, and customer notification itself. Where real-time responses were needed—credit decision, eligibility confirmation—that could still happen synchronously. But synchronous did not have to mean channel-owned orchestration.

This is where ArchiMate’s basic relationships were surprisingly useful. Assignment clarified who performed behavior. Serving clarified what service supported what process step. Realization showed how application services were implemented. Application collaboration made the necessary cooperation visible without pretending that every interaction implied shared ownership.

One practical question helped in almost every review: is this integration flow primarily data movement, decisioning, or orchestration?

If the answer was “a bit of all three,” there was usually a design problem hiding there.

A concrete before-and-after: prepaid-to-postpaid migration in retail

This was one of the clearer scenarios because it touched enough domains to expose the flaws.

Before redesign, a store rep initiated a prepaid-to-postpaid migration in the POS. The POS called CRM to update customer profile details, a credit check service for postpaid eligibility, a legacy subscriber management platform to change subscriber state, billing to establish a postpaid account, and provisioning to activate relevant service features. The ESB transformed payloads between formats, retried failures, and in some cases invoked follow-up actions. If asynchronous failures occurred, they surfaced by email, overnight reconciliation reports, or a support dashboard that store staff never looked at.

So what happened? Partial completion. A lot of it.

The customer left the store believing the migration was done. Billing might be active while provisioning lagged. Or subscriber migration succeeded while financing attachment failed. Reversals often required back-office intervention because compensation logic was either absent or trapped in support runbooks rather than modeled into the architecture.

After redesign, the flow became far more coherent.

The POS submitted a single business request to order management. That request contained the sale context, customer identity reference, selected offer, financing intent if relevant, and captured consents. Order management orchestrated downstream domain services. Product catalog and eligibility services were invoked where needed. Billing setup, subscriber migration, provisioning, and notifications happened as part of an order-owned process. Kafka was still used, but for status propagation and domain events rather than as a shadow orchestration engine. Exception handling and compensation paths were modeled as first-class architecture behavior, not left for testing or production support to discover later.

Here is the simplified after-state shape.

Diagram 2 — How a Telco Redesigned Its Integration Landscape Using Archi

Notice what changed. Not every synchronous interface disappeared. That would be simplistic and wrong. Retail still needed real-time eligibility and credit decisions because a customer was standing at the counter. Some payment authorization patterns remained synchronous too. The point was not to eliminate sync. The point was to make the authoritative process live somewhere accountable and observable.

The ArchiMate views made three things visible that had previously been blurred together:

where the authoritative process lived
where application services were duplicated or faked by channels
where events should be informative versus commanding

That last point is subtle and important. In many event-driven programs, teams start publishing events as if that by itself creates decoupling. It doesn’t. Some events are status propagation. Good. Some are effectively hidden commands. Dangerous. If an event is the only thing telling another domain to perform a required business action, then you need to be much more explicit about service ownership, contract semantics, idempotency, and failure behavior. The telco learned that the hard way.

The ugly middle: where ArchiMate models usually go stale

This part deserves some honesty because enterprise architecture articles often skip it.

The redesign workshop was not the end of the hard part. It was the beginning of the ugly middle, where the model risks becoming polished fiction.

That happened here too.

The first target-state model looked clean, but it underplayed dealer channel variations. It also barely modeled returns and cancellations, which in retail telco are exactly the scenarios that expose whether your process ownership is real or just performative. Batch reconciliation remained almost invisible until integrated testing started surfacing mismatches between order states and billing states. Some teams quietly reverted to local shortcuts because the target services were not delivered in time and commercial deadlines did not move.

That is normal, by the way. Frustrating, but normal.

The response was not to build more diagrams. It was to maintain a smaller set of decision-critical views and use them aggressively in governance.

One for retail operations, focused on customer journey and failure states.

One for integration engineering, showing application collaboration and pattern choice—API, event, batch, file, MQ, whatever was actually in play.

One for funding and governance, linking target capabilities and transition stages so executives could see why some dependencies had to be sequenced.

One for production support, mapping dependencies and handoffs clearly enough that incident triage did not start from tribal memory.

That “one view per stakeholder argument” idea worked far better than a giant repository nobody updated. In my experience, architecture repositories fail less because of tooling and more because they try to be encyclopedias instead of decision instruments.

And model failure paths early. Really early. If you wait for testing, you have already pushed architecture responsibility downstream.

The third mistake: treating canonical data model as a cure-all

This one deserved its own intervention because the team lost a lot of time to it.

At first, they went after the classic enterprise dream: one canonical customer, one canonical order, one canonical product. On paper, that sounds disciplined. In practice, it triggered endless mapping debates and slowed delivery. Worse, it created fake consistency across domains with genuinely different semantics.

Retail made the problem obvious. A device bundle, mobile plan, accessories, insurance product, and financing agreement are related in the customer journey, but they are not the same thing semantically. Trying to force all of them into one universal “product order” structure made integrations brittle. Every new proposition required somebody to stretch the canonical shape again, which meant more transformations, more optional attributes, more hidden assumptions, and more disagreement.

The redesign took a more pragmatic approach. Canonical patterns were used only where they genuinely reduced friction, especially at stable interaction points. Domain semantics remained explicit. Published contracts were tied to application services rather than to an abstract universal model. Shared meaning was modeled where it mattered—customer identity references, order identifiers, fulfillment status milestones—but the architecture stopped pretending that every object needed to be enterprise-universal.

ArchiMate’s object and relationship concepts were enough to make this visible without overcomplicating it. The lesson was not anti-standardization. It was anti-fantasy.

Standardize interaction points before you over-standardize data structures.

I feel strongly about that because I have watched more than one transformation get stuck in canonical model theology while stores and channels continue shipping workaround logic into production.

How the target integration landscape was represented in ArchiMate

The target state did not live in one grand master diagram. That would have failed for the same reason the original current-state wallpaper failed.

The useful model set was smaller and more specific.

A capability-to-application support view showed which applications supported sales, order handling, customer management, charging, provisioning, and retail operations capabilities. This was important in funding conversations because it linked platform investment to business capability outcomes rather than to technical modernization language.

A retail order journey business/application alignment view mapped the customer journey steps to supporting application services. This was the one retail operations people could actually use.

An application cooperation view centered on order orchestration showed which components collaborated, where service boundaries sat, and where channel apps stopped owning process sequencing.

An integration pattern view distinguished API, event, and batch interactions. This mattered because not every legacy dependency could be cleaned up immediately. Some batch patterns remained for reconciliation or low-frequency updates, and pretending otherwise would have been dishonest.

A transition-state view showed phased decoupling from legacy systems. That one saved a lot of pointless arguments. Architects often present target-state purity; delivery leaders need migration reality.

Some of the modeled services were straightforward: eligibility check, order submission, customer update, payment authorization, billing account setup, provisioning activation. Event flows represented milestones like order accepted, provisioning completed, billing account created, financing approved, migration failed. The technology layer only appeared where it changed a decision—for example, where Kafka was the chosen pattern for status propagation, where cloud-native API management affected non-functional constraints, or where IAM token exchange shaped trust boundaries across applications.

That last point is worth highlighting. IAM often gets treated as a technical sidecar. In channel-heavy telco landscapes, it absolutely influences architecture. Identity propagation, delegated authorization, customer consent traceability, and partner channel trust are not implementation details when they affect process design and supportability.

And there is one practical rule I wish more teams followed: don’t use ArchiMate notation to impress people. Use it to reduce ambiguity. If a model needs ten minutes of legend explanation before anyone can read it, the model is failing.

What changed for retail delivery teams after the redesign

The redesign did not produce miracles. Legacy constraints did not vanish. Some old BSS components still behaved exactly as badly as before. A few brittle integrations remained. Transition states were awkward.

But the operating experience improved in visible ways.

New offers no longer required quite so many channel-specific integrations. Store, app, and call center channels began sharing more consistent order status because status propagation followed a clearer model. Root-cause analysis got faster because the architecture made dependencies and ownership more explicit. Support teams depended less on the one ESB veteran who knew where the hidden routing rules lived. That alone was worth a lot.

The delivery impact was concrete. Launching a new handset financing offer still required coordination across retail, finance, billing, and order management, but it no longer demanded bespoke sequencing logic in the POS. Click-and-collect from retail inventory became easier to reason about because order ownership and inventory confirmation boundaries were clearer. Returns that touched billing and financing remained painful—returns are always where architecture gets judged—but at least the responsible domains and compensation paths were visible.

Governance changed too. Integration reviews shifted away from “does this meet our interface standards?” toward “does this align with target responsibilities and process ownership?” That is a healthier conversation. Interface standards matter, but they are downstream of the more fundamental architecture choices.

Guardrails became more practical. Channel teams were allowed to optimize UX responsiveness, but not to absorb orchestration logic by stealth. Middleware teams could implement mediation and resilience patterns, but not hide business policy. Event producers had to define business semantics, not just schemas. Those are the kinds of governance changes that actually stick because they tie architecture decisions to delivery behavior.

Lessons I would apply earlier next time

A few things I would do sooner, with less hesitation.

Model one painful customer journey first. Not a generalized process map. A real one with edge cases, revenue impact, and operational scars.

Put exception handling on the architecture diagram, not only in test cases. If it matters in production, it belongs in the model.

Do not let middleware hide business logic. If it sequences, decides, or compensates, expose it.

Separate channel UX speed needs from core process ownership. Real-time is not the same thing as channel-owned orchestration.

Use ArchiMate views as decision tools, not repository decoration.

Accept hybrid integration patterns during transition. Some APIs, some events, some batch. Purity is overrated when migration reality is messy.

And keep retail edge cases in scope early. Stores, dealers, returns, financing, portability failures—those scenarios expose weak boundaries faster than any architecture principles deck ever will.

One hard truth: if stores rely on manual fallback, your architecture is part of the operating model whether you admit it or not.

I have seen teams treat manual workarounds as temporary noise. They rarely are. More often, they are the clearest signal that process ownership is unresolved.

A compact anti-pattern checklist for telco integration redesign

If I were reviewing another telco retail transformation tomorrow, I would start with this checklist:

POS or digital channel orchestrates end-to-end fulfillment
CRM owns responsibilities beyond customer profile and relationship context
ESB or middleware contains hidden business rules
Event topics are being used instead of clear service contracts
Canonical data modeling is stalling delivery
Exception flows are absent from architecture views
Target-state diagrams ignore transition architecture
Retail and dealer channels are treated as operationally identical
Kafka topics carry ambiguous business semantics
IAM and consent propagation are treated as afterthoughts
Production support has no dependency view usable during incidents

If several of those are true, the integration problem is usually architectural, not merely technical.

ArchiMate did not solve the integration problem

It made the real problem impossible to ignore.

That was the value.

Not notation purity. Not framework compliance. Not a nicer repository.

The telco got value from ArchiMate when it used the language to surface ownership, behavior, and coupling clearly enough that redesign decisions could be made responsibly. Once the team could see where orchestration lived, where services were duplicated, where middleware was acting like an application, and where customer-facing failures had no modeled recovery path, the conversation became more honest.

And in multi-channel telco, honesty arrives quickly because integration failures show up at the counter, in front of the customer, with a store rep waiting for the screen to stop spinning.

That is a healthy kind of pressure, actually. It forces architecture to answer for operations.

So my pragmatic conclusion is this: if the model helps teams decide where process ownership belongs and what should integrate versus collaborate, it is doing its job. If it only documents complexity, it is just a prettier version of the problem.

FAQ: Questions architects usually ask after seeing this kind of redesign

Why use ArchiMate instead of BPMN plus interface catalogs?

Because BPMN is strong for process detail and interface catalogs are useful reference artifacts, but neither gives you the same cross-layer language for business behavior, application services, components, and technology choices in one coherent set of views. You do not need ArchiMate for everything, but it is very effective when the problem is unclear ownership across layers.

How much detail is enough for integration views?

Enough to support a decision. That sounds evasive, but it is true. For architecture work, model customer-impacting flows, owning components, service boundaries, and failure paths first. Protocols, payload schemas, retries, and field-level mappings belong in supporting artifacts unless they materially change architecture choices.

Where should event-driven patterns be modeled in ArchiMate?

Primarily in application cooperation and flow-oriented views, with technology layer detail only when the platform choice changes a design decision. The key is to model business semantics, not just topic plumbing.

How do you handle legacy systems that cannot expose clean application services?

Be honest. Model adapter or façade patterns explicitly. Sometimes an integration layer has to compensate temporarily. The trick is to show that as transition architecture, not as the target end state.

What is the minimum useful model set for a telco retail transformation?

I would start with four views: customer journey business/application alignment, order orchestration cooperation, integration pattern view, and transition-state roadmap. Add capability mapping if funding alignment is a problem—and it usually is.

Frequently Asked Questions

What is enterprise architecture?

Enterprise architecture aligns strategy, business processes, applications, and technology. Using frameworks like TOGAF and languages like ArchiMate, it provides a structured view of how the enterprise operates and must change.

How does ArchiMate support enterprise architecture?

ArchiMate connects strategy, business operations, applications, and technology in one coherent model. It enables traceability from strategic goals through capabilities and application services to technology infrastructure.

What tools support enterprise architecture modeling?

The main tools are Sparx Enterprise Architect (ArchiMate, UML, BPMN, SysML), Archi (free, ArchiMate-only), and BiZZdesign. Sparx EA is the most feature-rich, supporting concurrent repositories, automation, and Jira integration.