Pipeline Thinking
From ingestion to insight
I approach projects as connected workflows, not isolated scripts, with attention to extraction, transformation, validation, and output quality.
Open to Data Engineering Intern roles
Ho Chi Minh City, Vietnam
Data Engineering Intern / Junior Data Engineer focused on turning raw data into clean, reliable, analysis-ready systems.
I build practical data workflows with Python, Pandas, SQL, ETL, and machine learning, with a strong interest in analytics pipelines, trustworthy data models, and decision-ready reporting.
Focus
Pipeline design
Moving data from ingestion to clean, structured outputs.
Core Stack
Python, Pandas, SQL
Building reliable transformations and analytical datasets.
Goal
Internship-ready impact
Contributing to engineering teams with practical project experience.
Professional Summary
Aspiring data engineer with a strong interest in analytics infrastructure, quality-focused transformation logic, and business-facing insights.
About
My projects are centered on the full path from raw input to useful output: collecting data, cleaning inconsistencies, transforming datasets, and preparing reliable structures for reporting or machine learning.
I am especially interested in data pipelines, ETL workflows, analytical modeling, and the operational discipline required to make data products trustworthy. I like solving problems where good structure, reproducibility, and clear logic matter.
As I apply for Data Engineering Intern and Junior Data Engineer positions, I want to contribute with a strong Python and SQL foundation, curiosity for production systems, and the mindset to keep improving data quality at every stage.
Pipeline Thinking
I approach projects as connected workflows, not isolated scripts, with attention to extraction, transformation, validation, and output quality.
Analytics Foundation
I use Python, Pandas, and SQL to clean datasets, shape meaningful tables, and support exploratory analysis, dashboards, and modeling tasks.
Practical Delivery
My portfolio combines event analytics, forecasting, business systems, and public notebooks to show both implementation detail and practical problem-solving.
Skills
I focus on the tools and practices needed to collect, transform, model, and analyze data with clarity and reliability.
Strong foundations for data preparation, querying, and structured problem-solving.
Building repeatable data flows with clear transformations and analytics-ready outputs.
Applying feature engineering and model evaluation on structured, well-prepared data.
Working in a project-driven way with version control, iterative delivery, and public portfolio visibility.
Featured Projects
These projects highlight how I structure data, automate processing, and translate problem statements into useful outputs for analysis or decision-making.
Event Analytics Pipeline
Designed a practical analytics pipeline to clean raw event streams, standardize schemas, transform user activity into structured tables, and support decision-ready metrics for funnels, engagement, and retention.
Why it fits data engineering
This project reflects my ability to think beyond isolated analysis and design a repeatable flow from ingestion to reporting-ready outputs.
Gold Price Forecast
Built a forecasting workflow focused on data cleaning, feature preparation, trend analysis, and model comparison to predict gold price movement with clear evaluation logic.
Contribution
Demonstrates structured preprocessing, feature engineering, and evaluation habits that are essential before models can be trusted.
SME Management System
Developed a system-oriented project that required structured data handling, process organization, and application logic supporting day-to-day operational workflows.
Contribution
Shows my ability to work with structured business data and think carefully about data consistency, usability, and system outcomes.
Kickstarter DApp Contract
Built a smart contract project that strengthened my thinking around system rules, data integrity, and predictable behavior in environments where correctness matters.
Contribution
While outside classic analytics, it reflects careful workflow design and reliability-oriented thinking that also matters in engineering data systems.
Kaggle Projects
My Kaggle work extends the portfolio with hands-on experimentation, feature engineering, exploratory workflows, and public demonstrations of how I approach dataset preparation and modeling.
Why it matters
It shows consistent practice, visible output, and a habit of learning by doing, which is especially valuable at the internship level.
Visit Kaggle profileAchievements
Applied Portfolio
5
Key projects across event analytics, forecasting, business systems, blockchain logic, and Kaggle experimentation.
Public Presence
3
Active channels for recruiters and teams to review my GitHub, Kaggle, and LinkedIn work.
Career Focus
1
Clear direction toward Data Engineering Intern and Junior Data Engineer opportunities with practical, role-aligned projects.
Contact
If you are hiring for a Data Engineering Intern or Junior Data Engineer role, I would be excited to connect and discuss how I can contribute with strong fundamentals, curiosity, and practical project experience.
GitHub
tapheret2
Code portfolio and project repositories
Kaggle
taphere
Notebooks, experiments, and datasets
phamtienphat
Professional profile and networking
Location
Ho Chi Minh City
Vietnam, open to internship opportunities