From 37-Year Marine Engineer to AI Model Evaluation Expert: My Career Transformation Journey

#AI Transformation #Career Change #LLM Evaluation #Red Teaming #Engineering Thinking

Introduction: Why Transform at 59?

My name is Ma Weize. I'm 59 years old and a professor-level senior marine engineer. From my university graduation in 1988 to 2024, I've worked for 37 years in ship design, high-speed craft development, and propeller design. At this life stage, many would choose to enjoy the stability before retirement, but I made a decision that surprised everyone around me: a complete transformation into AI model evaluation and safety alignment.

This wasn't an impulse. Over the past year, I've intensively used large language models like Claude, Deepseek, and ChatGPT to assist my work, personally experiencing how AI changes engineering design workflows. More importantly, I discovered a unique opportunity: my 37 years of engineering experience is exactly what AI model evaluation needs most.

I. Discovering a New Direction: From User to Evaluator

1. Insights from a Heavy AI User

In early 2024, I started using Claude Code to help develop ShipTechAI—a ship propeller design tool based on the Rolla SPP database. Through this process, I gained deep comparative knowledge of different AI models:

Through daily use, I began pondering: How reliable are these models in industrial scenarios? How much human verification do their outputs require? How can we systematically evaluate their performance?

💡 Turning Point

When I used Claude to process ship resistance calculations, I discovered it had omitted a crucial coefficient in the formula. This error would be hard for a layperson to detect but could be fatal for an engineer. This made me realize: AI model evaluation needs domain experts, not just AI experts.

2. The Unique Value of an Engineer's Perspective

After deeply exploring LLM model evaluation and Red Teaming, I found my background offers unique advantages:

  1. Real Scenario Understanding: I know how engineers actually use AI in their work and which scenarios are most error-prone
  2. Professional Knowledge Depth: Ability to design complex domain-specific test cases, not just simple generic questions
  3. Systematic Thinking: 37 years of engineering project management experience makes me adept at establishing evaluation frameworks and methodologies
  4. Risk Awareness: The engineering field's strict requirements for safety and reliability align perfectly with AI safety alignment goals

II. Transformation Path: How Engineering Thinking Accelerates AI Learning

1. Phase One: Learning Python from Scratch (10 Days)

Many think learning programming at 59 is impossible. But I proved with 10 days—from zero foundation to independently developing the ShipTechAI full-stack application—that engineers' learning capability is powerful.

"I didn't follow traditional tutorials to learn Python. I started directly with projects. When I didn't know something, I asked Claude, requiring it to explain the principles for each feature. This 'learn-by-doing' approach let me master content that typically takes months in just 10 days."

Key Methods:

2. Phase Two: Deep Dive into LLM Principles and Evaluation Methods (1 Month)

After mastering Python, I systematically studied LLM working principles, prompt engineering, and model evaluation methodologies. In this phase, my engineering background proved valuable again:

3. Phase Three: Practical Project Accumulation (Ongoing)

I accumulated evaluation experience through real projects:

  1. ShipTechAI Development: Deep comparison of Deepseek and Claude's performance in engineering applications
  2. Professional Domain Testing: Designing a marine engineering-specific model evaluation case library
  3. Industrial AI Safety Research: Studying LLM reliability in high-risk scenarios like industrial control and design calculations
  4. Blog Sharing: Documenting the transformation process to help other traditional engineers understand AI

III. Key Insights: Engineers' Unique Advantages in AI Evaluation

1. Real-Scenario Evaluation Cases

Most AI evaluations are based on generic tasks (translation, Q&A, code generation). But industrial AI's real challenge lies in complex scenarios in professional domains. For example:

🔬 Example: Propeller Design Evaluation Case

Question: "Design a surface piercing propeller for a 25-knot high-speed craft, 15 meters long, 8-ton displacement, with twin engines of 300 horsepower each."

Evaluation Dimensions:

  • ✅ Does it choose the correct design method (B-Troost chart vs Rolla SPP database)?
  • ✅ Are the calculation formulas accurate (resistance estimation, power distribution, RPM range)?
  • ✅ Are parameter ranges reasonable (diameter, pitch ratio, disk area ratio)?
  • ✅ Does it consider practical constraints (draft limitations, cavitation risk, vibration issues)?
  • ✅ Is the recommendation implementable (material selection, machining difficulty, cost estimation)?

Result: Through such professional cases, we can precisely evaluate models' reliability in engineering domains, which generic evaluation cannot cover.

2. Systematic Evaluation Framework

Systems engineering methods from project management apply directly to model evaluation:

3. Industrial AI Safety Practical Experience

Marine engineering is a high-safety-requirement field, giving me natural sensitivity to AI safety:

IV. Practical Results: The ShipTechAI Project

1. Project Overview

ShipTechAI is my first complete project after transitioning to AI. It's a ship propeller design tool based on the Rolla SPP database. This project not only validated my technical capabilities but also became an experimental platform for studying AI model performance in engineering applications.

2. Technical Implementation

3. Deep Application of AI-Assisted Development

During ShipTechAI development, I deeply compared different AI models' performance:

📊 Real Model Comparison Data

Deepseek:

  • ✅ 3D interpolation algorithm implementation: Excellent (concise code, high performance)
  • ✅ Complex mathematical formula conversion: Excellent (accurate understanding of engineering formulas)
  • ⚠️ UI/UX design suggestions: Average (lacks aesthetic judgment)

Claude:

  • ✅ Code architecture design: Excellent (modular, highly maintainable)
  • ✅ Problem diagnosis and debugging: Excellent (quick bug location)
  • ⚠️ Professional domain knowledge: Medium (needs detailed background from me)

ChatGPT:

  • ✅ General programming questions: Good (solid fundamentals)
  • ⚠️ Complex engineering algorithms: Requires multiple rounds of correction
  • ⚠️ Code consistency: Prone to contradictions across multiple conversations

V. Future Direction: Industrial AI Evaluation and Safety Alignment

1. Short-term Goals (6 Months)

2. Medium-term Goals (1-2 Years)

3. Long-term Vision

Promote a new "Engineer + AI Evaluator" career development path. Traditional engineers possess rich professional knowledge and practical experience, which is exactly what AI model evaluation needs most. Through my transformation journey, I hope to prove:

VI. Advice for Traditional Engineers Considering Transformation

1. Mental Preparation

2. Learning Path

  1. Step One: Become a heavy AI user (3 months)
    • Use Claude/Deepseek daily to solve real work problems
    • Record which scenarios AI performs well in and which it doesn't
    • Learn prompt engineering to improve AI usage efficiency
  2. Step Two: Learn Python basics (1 month)
    • Don't read tutorials; start projects directly with AI assistance
    • Choose automation tool development related to your job
    • Require AI to explain principles for each feature
  3. Step Three: Deep dive into AI evaluation (ongoing)
    • Systematically learn LLM principles and evaluation methods
    • Design evaluation cases for your professional domain
    • Compare different models' performance and document results
    • Participate in open-source AI evaluation projects

3. Pitfalls to Avoid

Conclusion: Lifelong Learning, Never Stop

My 37-year engineering career taught me one thing: technology changes, but problem-solving thinking methods remain constant. From ship design to AI model evaluation, they appear to be completely different fields, but the essence is the same: how to establish systematic methodologies, how to find optimal solutions under complex constraints, how to ensure system safety and reliability.

Transforming at 59 isn't an endpoint but a new starting point. The AI era offers traditional engineers unprecedented opportunities: our professional knowledge, systematic thinking, and engineering experience are exactly what AI model evaluation and safety alignment need most. Age isn't a barrier; lifelong learning is the only constant in career development.

"When I learned Python in 10 days, developed ShipTechAI in 1 month, and became a practitioner in AI evaluation in 3 months, I realized: learning ability doesn't decline with age, as long as you maintain curiosity and drive to act."

If you're also a traditional engineer thinking about how to face the AI era's challenges, I want to tell you: now is the best time. AI isn't here to replace you; it's a tool that makes your experience more valuable.

About the Author:
Ma Weize, professor-level senior marine engineer with 37 years of experience in ship design and high-speed craft development. Currently focused on LLM model evaluation, AI safety alignment, and industrial AI application research. Creator of the ShipTechAI tool.

📧 Contact: jason15994264083@gmail.com
🌐 Website: www.shiptechai.com
🚁 ShipTechAI Tool: Try Online

← Back to Blog