Innocent Musanzikwa
Verified Expert in Engineering
Data Engineer and Developer
Inno是一位经验丰富的数据工程师和开发人员,在过去的十年里,他在非洲和北美的顶级零售数据分析公司iri工作,并在过去的几年里担任自由顾问. As a SQL and ETL developer, 他使用行业标准技术(如Kimball和DataVaults)创建了高质量的数据仓库. As a data engineer, Inno使用几种最新的尖端技术,在本地和云上构建了高度健壮和可扩展的数据管道.
Portfolio
Experience
Availability
Preferred Environment
SQL, PySpark, Python, Hadoop, Apache Hive, Azure Synapse, Oracle, SQL Server Integration Services (SSIS), Azure Data Factory, Data Warehousing
The most amazing...
...我设计的大数据仓库和数据集成解决方案——使用Python, SQL, ADF, Hadoop, Hive, and Spark—won an RFP in Canada out of six competitors.
Work Experience
Data Engineer
Darwill, Inc.
- 使用AWS Redshift和Aurora数据库构建Tableau仪表板和可视化.
- 为自定义ETL任务和临时请求创建运行Python的AWS Lambda函数.
- 管理AWS Redshift和Aurora数据库,设计数据仓库和数据迁移.
- 使用AWS技术栈重新设计了客户端的数据仓库,并通过引入运行Python管道的联邦查询和Lambda函数改进了他们的迁移过程, as well as overhauling their Tableau dashboards.
Data Engineer
SFL Scientific LLC
- 就现有的SSIS设计不良的数据集成项目提供咨询,并帮助确定瓶颈和低效率.
- 使用SSIS重新设计现有的数据管道,以提高效率和可扩展性.
- Performed SQL tuning and SQL code review for process efficiencies.
BI and Data Warehouse Expert
Airiam Holdings, LLC
- 设计和开发数据管道,集成来自Quickbooks API的数据, Sage Intacct API, and spreadsheets into Azure SQL.
- Designed and developed a data warehouse in Azure SQL.
- 使用Power BI设计和创建业务报告和KPI仪表板.
- 开发复杂的SQL脚本来管理数据转换和加速集成.
Data Analyst for Migration Project
JLL - JLLT Data
- 开发数据管道,将数据从Salesforce集成到Microsoft SQL.
- Designed advanced SQL code, e.g.、CTE、存储过程和管理数据转换的函数.
- 执行SQL调优以提高ETL效率和流程可伸缩性.
- Consulted on standard operating procedures and best case scenarios.
Director | Data Engineering
IRI
- 开发Azure数据工厂管道,集成来自Apache Hive的数据, HDFS, OAuth 2 APIs, and various flat-file types into Azure SQL.
- Managed a team of onshore and offshore big data developers, assigning tasks and tracking the progress on Jira.
- 监督新数据源和正在进行的项目的数据策略和建议.
- Mentored big data engineers to help them develop their skills.
- 根据客户要求或技术变更,构建新的数据模型并升级旧的数据仓库.
ETL Architect
IRI
- Developed SQL-based data warehouses on-premise and on the cloud.
- 集成了从平面文件到基于云的数据源(如Snowflake)的各种数据源, AWS and data lakes into Azure Data Warehouse, and Apache Hive on Hadoop.
- 创建了可扩展的数据管道,提高了现有管道的效率.
- 培训和提高新数据开发人员的技能,并参与代码审查.
- 维护所有业务数据组件和策略的系统文档.
SQL Lead Developer
IRI
- Developed SQL-based data warehouses and data marts.
- Wrote SQL queries to provide data for SSRS reports.
- 根据客户端需求,ETL进程使用SSIS、Talend、DataStage.
- 使用SQL Server Reporting Services (SSRS)创建自定义业务报表.
- Managed junior developers and ran stand-up development meetings.
SQL/ETL Developer and Consultant
Mi9 Retail (formerly JustEnough Software Corporation)
- Managed SQL replication between mobile devices and SQL Server.
- 使用Kimball方法为报告目的创建SQL数据仓库.
- 使用SQL Server集成服务(SSIS)设计和开发ETL包.
- 在SQL Server Reporting Services (SSRS)中设计和开发报表.
- 对部署到生产环境中的任何代码执行数据库调优和代码审查.
Experience
Data Migration from Azure SQL to Snowflake
http://github.com/innowarue/ADF我用我的Azure和Snowflake帐户替换了真实的数据源,以便在不损害机密性的情况下公开提供项目.
Data Integration from OAuth2 API
SQL Server Replication to Mobile Devices
In-place Data Integration for an Acquisition
Kafka Streaming and Data Integration
Skills
Languages
SQL, Python, Bash Script, T-SQL (Transact-SQL), Snowflake, Stored Procedure, SQL DML, Scala, JavaScript, Bash
Frameworks
Hadoop, Spark, Windows PowerShell, ADF
Libraries/APIs
PySpark, REST APIs, Spark Streaming
Tools
Microsoft Power BI, Tableau, BigQuery, Synapse, SSAS, Apache Airflow, Amazon Elastic MapReduce (EMR), Git, Google Sheets
Paradigms
ETL, Business Intelligence (BI), Dimensional Modeling, Database Development, Database Design, Data Science
Platforms
Amazon Web Services (AWS), AWS Lambda, Azure SQL Data Warehouse, Dedicated SQL Pool (formerly SQL DW), Azure, Microsoft Power Automate, Azure Synapse, Oracle, Databricks, Apache Kafka, Salesforce, Zeppelin
Storage
Apache Hive, MySQL, SQL Server Integration Services (SSIS), SQL Server Reporting Services (SSRS), PSQL, Microsoft SQL Server, SQL Stored Procedures, PostgreSQL, Databases, Data Pipelines, Data Integration, Relational Databases, Database Architecture, RDBMS, Database Modeling, Dynamic SQL, NoSQL, SQL Server DBA, Database Replication, Azure SQL, MariaDB
Other
Azure Data Factory, Data Warehousing, Data Analysis, Data Engineering, Data, Data Architecture, Big Data, Data Migration, ELT, Data Warehouse Design, Data Transformation, Database Schema Design, ETL Tools, Scripting Languages, Data Analytics, Data Visualization, SSRS Reports, SQL Server 2015, Entity Relationships, Business Analytics, Performance Tuning, Data Modeling, Cloud, APIs, Dashboard Design, Dashboards, Web Scraping, Data Build Tool (dbt), iPaaS, CI/CD Pipelines, DAX, Data Cleansing, Azure Databricks
Education
Bachelor's Degree in Information Technology
University of South Africa - Pretoria, South Africa
Certifications
Databricks Certified Data Engineer Associate
Databricks
SnowPro Core
Snowflake
Certified Apache Spark and Hadoop Developer
Cloudera
Analyzing Big Data with Hive
LinkedIn Learning
Advanced NoSQL for Data Science
LinkedIn Learning
How to Work with Toptal
在数小时内,而不是数周或数月,我们的网络将为您直接匹配全球行业专家.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring