Data Lake & KNIME Hub
Your gateway to Canon's IDF data lake. Explore Oracle EBS, Orion, and custom tables — build KNIME workflows without writing code, or use SQL when you need precision.
Data Lake Architecture
┌────────────────────────────────────────────────────────────────────────────────────┐
│ CANON IDF DATA LAKE (Azure) │
├────────────────────────────────────────────────────────────────────────────────────┤
│ │
│ ┌──────────────────┐ ┌──────────────────┐ ┌──────────────────┐ │
│ │ ORACLE EBS │ │ ORION │ │ CUSTOM TABLES │ │
│ │ (R12 Replica) │ │ (Tax Engine) │ │ (IDF Project) │ │
│ │ │ │ │ │ │ │
│ │ • AP Invoices │ │ • Tax Rules │ │ • E-Invoice Log │ │
│ │ • AR Invoices │ │ • Tax Rates │ │ • Country Config │ │
│ │ • GL Journals │ │ • Registrations │ │ • Validation │ │
│ │ • Tax Lines │ │ • Determinations │ │ • Mapping Tables │ │
│ │ • Suppliers │ │ • Exemptions │ │ • Audit Trail │ │
│ │ • Customers │ │ • Reporting │ │ • Error Log │ │
│ │ • Org Structure │ │ │ │ │ │
│ └────────┬─────────┘ └────────┬─────────┘ └────────┬─────────┘ │
│ │ │ │ │
│ └──────────────────────┼──────────────────────┘ │
│ │ │
│ ┌───────────────────────────────┴────────────────────────────────────┐ │
│ │ AZURE SQL DATABASE / SYNAPSE ANALYTICS │ │
│ │ (Queryable via KNIME Database Connector) │ │
│ └───────────────────────────────┬────────────────────────────────────┘ │
│ │ │
└──────────────────────────────────┼─────────────────────────────────────────────────┘
│
┌──────────────┴──────────────┐
│ KNIME BUSINESS HUB │
│ (Standard Edition) │
│ │
│ • Shared Workflows │
│ • Scheduled Execution │
│ • Team Collaboration │
│ • Version Control │
└─────────────────────────────┘
Table Network Map
Visual overview of all table relationships in the data lake — Oracle EBS, Orion, and custom tables with their connections.
Table Explorer & Search
Find data in the data lake using natural language. Discover how tables connect via primary and foreign keys.
KNIME Workflows
Pre-built KNIME workflow blueprints with node-by-node visual guides. Designed for non-IT users on KNIME LTS 5.8.2.
SQL Library
Reusable DB Query node SQL for testing, validation, and advanced use cases. Copy-paste ready.
Orion & ONESOURCE (Tax Engines)
Deep dive into the Azentio Orion and Thomson Reuters ONESOURCE integration. ZX_LINES as single source of truth, anti-duplication patterns, and config-only use of Orion tables.
KNIME Business Hub Configuration
Environment Details
| KNIME Version | LTS 5.8.2 |
| Hub Edition | Standard |
| Database | Azure SQL / Synapse |
| JDBC Driver | Microsoft JDBC 12.x |
| Authentication | Azure AD / SQL Auth |
Recommended Node Extensions
- ●KNIME Database — DB Connector, DB Table Selector, DB Reader, DB Query, DB Writer
- ●KNIME Microsoft SQL Server — Azure SQL connector preset
- ●KNIME Excel Integration — Export results to Excel
- ●KNIME Reporting — Generate formatted reports
- ●KNIME Python Integration — Advanced transformations (optional)
Database Connection Setup (One-Time)
KNIME Visual Flow
┌─────────────────┐ ┌─────────────────┐
│ Microsoft SQL │────▶│ DB Table │
│ Server │ │ Selector │
│ Connector │ │ │
│ │ │ (Pick table) │
│ [Config below] │ └────────┬────────┘
└─────────────────┘ │
▼
┌─────────────────┐
│ DB Reader │
│ │
│ (Load to KNIME │
│ data table) │
└─────────────────┘
Connector Node Settings
| Node | Microsoft SQL Server Connector |
| Hostname | [your-server].database.windows.net |
| Port | 1433 |
| Database | IDF_DataLake |
| Authentication | Credential (flow variable) |
| Encryption | TLS 1.2 (required for Azure) |
| Timeout | 30 seconds |
Best Practice: Store credentials in KNIME Credential Configuration node or Hub secrets — never hardcode in workflow.
Design Philosophy
Database Nodes First
Prefer KNIME Database nodes (processing happens in the database) for speed and scalability. Only pull data into KNIME when you need local transformations.
Visual Over Code
Every workflow is designed to be understood by non-IT users. SQL is available for power users and testing, but the visual flow is the primary interface.
VAT & CIT Focused
All queries, joins, and workflows are designed through the lens of indirect tax (VAT) and corporate income tax (CIT) use cases.