Data Lake & KNIME Hub

Your gateway to Canon's IDF data lake. Explore Oracle EBS, Orion, and custom tables — build KNIME workflows without writing code, or use SQL when you need precision.

Data Lake Architecture


┌────────────────────────────────────────────────────────────────────────────────────┐
│                           CANON IDF DATA LAKE (Azure)                               │
├────────────────────────────────────────────────────────────────────────────────────┤
│                                                                                    │
│  ┌──────────────────┐   ┌──────────────────┐   ┌──────────────────┐              │
│  │   ORACLE EBS     │   │     ORION        │   │  CUSTOM TABLES   │              │
│  │   (R12 Replica)  │   │  (Tax Engine)    │   │  (IDF Project)   │              │
│  │                  │   │                  │   │                  │              │
│  │ • AP Invoices    │   │ • Tax Rules      │   │ • E-Invoice Log  │              │
│  │ • AR Invoices    │   │ • Tax Rates      │   │ • Country Config │              │
│  │ • GL Journals    │   │ • Registrations  │   │ • Validation     │              │
│  │ • Tax Lines      │   │ • Determinations │   │ • Mapping Tables │              │
│  │ • Suppliers      │   │ • Exemptions     │   │ • Audit Trail    │              │
│  │ • Customers      │   │ • Reporting      │   │ • Error Log      │              │
│  │ • Org Structure  │   │                  │   │                  │              │
│  └────────┬─────────┘   └────────┬─────────┘   └────────┬─────────┘              │
│           │                      │                      │                         │
│           └──────────────────────┼──────────────────────┘                         │
│                                  │                                                 │
│  ┌───────────────────────────────┴────────────────────────────────────┐           │
│  │              AZURE SQL DATABASE / SYNAPSE ANALYTICS                  │           │
│  │              (Queryable via KNIME Database Connector)                │           │
│  └───────────────────────────────┬────────────────────────────────────┘           │
│                                  │                                                 │
└──────────────────────────────────┼─────────────────────────────────────────────────┘
                                   │
                    ┌──────────────┴──────────────┐
                    │   KNIME BUSINESS HUB        │
                    │   (Standard Edition)        │
                    │                             │
                    │   • Shared Workflows        │
                    │   • Scheduled Execution     │
                    │   • Team Collaboration      │
                    │   • Version Control         │
                    └─────────────────────────────┘
        

KNIME Business Hub Configuration

Environment Details

KNIME VersionLTS 5.8.2
Hub EditionStandard
DatabaseAzure SQL / Synapse
JDBC DriverMicrosoft JDBC 12.x
AuthenticationAzure AD / SQL Auth

Recommended Node Extensions

  • KNIME Database — DB Connector, DB Table Selector, DB Reader, DB Query, DB Writer
  • KNIME Microsoft SQL Server — Azure SQL connector preset
  • KNIME Excel Integration — Export results to Excel
  • KNIME Reporting — Generate formatted reports
  • KNIME Python Integration — Advanced transformations (optional)

Database Connection Setup (One-Time)

KNIME Visual Flow


┌─────────────────┐     ┌─────────────────┐
│  Microsoft SQL  │────▶│  DB Table       │
│  Server         │     │  Selector       │
│  Connector      │     │                 │
│                 │     │  (Pick table)   │
│  [Config below] │     └────────┬────────┘
└─────────────────┘              │
                                 ▼
                        ┌─────────────────┐
                        │  DB Reader      │
                        │                 │
                        │  (Load to KNIME │
                        │   data table)   │
                        └─────────────────┘
                

Connector Node Settings

NodeMicrosoft SQL Server Connector
Hostname[your-server].database.windows.net
Port1433
DatabaseIDF_DataLake
AuthenticationCredential (flow variable)
EncryptionTLS 1.2 (required for Azure)
Timeout30 seconds

Best Practice: Store credentials in KNIME Credential Configuration node or Hub secrets — never hardcode in workflow.

Design Philosophy

Database Nodes First

Prefer KNIME Database nodes (processing happens in the database) for speed and scalability. Only pull data into KNIME when you need local transformations.

Visual Over Code

Every workflow is designed to be understood by non-IT users. SQL is available for power users and testing, but the visual flow is the primary interface.

VAT & CIT Focused

All queries, joins, and workflows are designed through the lens of indirect tax (VAT) and corporate income tax (CIT) use cases.