Home

Notes

  • ⌘k

BIBF-AI


Machine Learning Notes Website

This website aims to serve as a repository of notes from AWS machine learning training conducted at BIBF (Bahrain Institute of Banking and Finance) through a partnership program with Tamkeen.

The goal is to create an open platform where trainees can contribute and access a knowledge base to supplement their learning during the AWS ML certification course.

Contents

The website will host machine learning tutorials, examples, explanations of concepts, summaries of sessions, handy tips and tricks, etc. related to the curriculum.

  • Session-wise notes from lectures
  • Cheat sheets
  • Revision notes
  • Tips for certification

Recommend Path

  1. Cloud EssentialCloud EssentialUtilizing AWS: Unveiling its Significance The instructors shed light on the essential role of Amazon Web Services (AWS) and its profound importance. They narrated compelling stories to underscore the significance of embracing cloud computing, emphasizing two key aspects: 1. Disaster Recovery The instructors highlighted how AWS serves as a robust platform for disaster recovery, enabling businesses to swiftly recover and resume operations in the face of unforeseen disruptions. 2. Temporary Job

  2. Architecting-on-AWSArchitecting-on-AWSDay 1: AWS Security and Identity Management Create Role Trust Policy: The trust policy defines who or what can assume the role. It outlines the entities that are allowed to take on the permissions associated with the role. This can include AWS services, users, accounts, or even external identities Permissions Policy: The permissions policy specifies what actions and resources the role is allowed to access or manipulate. It defines the boundaries of the role's capabilities within AWS. Permi

  3. Exploratory Data AnalysisExploratory Data AnalysisExploratory Data Analysis Python Pandas Pandas is a fast, powerful, flexible and easy to use open source data analysis and data manipulation library built on top of the Python programming language. - Data Frame: A two-dimensional data structure, i.e., data is aligned in a tabular fashion in rows and columns. - Series: A one-dimensional labeled array capable of holding data of any type (integer, string, float, python objects, etc.). The axis labels are collectively called index.

  4. Data Analytics FundamentalsData Analytics FundamentalsData Analytics Fundamentals: From Raw Data to Actionable Insights Introduction to Data Analytics Data analytics is the systematic process of examining raw data to uncover meaningful patterns, correlations, and insights. In today's data-driven world, analytics transforms information into actionable knowledge that drives business decisions, operational improvements, and strategic planning. The Evolution of Data Analytics Historical Context: Descriptive Analytics**: What happened? (Traditional

  5. Data EngineeringData EngineeringData Engineering S3 S3 Storage Classes S3 Standard**: For general-purpose storage of frequently accessed data; designed for durability of 99.999999999% of objects over a given year. S3 Intelligent-Tiering**: Designed to optimize costs by automatically moving data to the most cost-effective access tier, without performance impact or operational overhead. S3 Standard-IA**: For long-lived, infrequently accessed data that needs to be accessed rapidly when needed. S3 One Zone-IA**: For long-lived

  6. Building Data AnalyticsBuilding Data AnalyticsBuilding Batch Data Analytics Solutions on AWS In this guide, we'll explore how to build batch data analytics solutions using Amazon EMR (Elastic MapReduce). Amazon EMR is a cloud-based big data platform that allows you to process large datasets efficiently. We'll cover various modules to help you understand the key components and best practices for designing and implementing batch analytics solutions. Module 1: Introduction to Amazon EMR What is Amazon EMR? Amazon EMR is a managed big data

  7. Machine Learning Pipeline on AWSMachine Learning Pipeline on AWSBuilding End-to-End Machine Learning Pipelines on AWS Machine learning pipelines represent the systematic approach to transforming raw data into production-ready models that deliver business value. This comprehensive guide explores the complete ML pipeline lifecycle on AWS, from problem formulation to model deployment, with practical examples and best practices for building scalable, reliable ML solutions. The ML Pipeline Lifecycle: A Comprehensive Overview A machine learning pipeline is a s

  8. MLOPS EngineeringMLOPS EngineeringMLOps Engineering: Building Reliable Machine Learning Pipelines Machine Learning Operations (MLOps) represents the convergence of machine learning, DevOps, and data engineering practices designed to streamline the deployment, monitoring, and management of ML models in production. This comprehensive guide explores MLOps fundamentals, AWS implementation strategies, and best practices for building scalable, reliable ML systems. The MLOps Imperative: Why ML Models Fail in Production Traditional

  9. Amazon Sage MakerAmazon Sage Makerexcerpt: "This about all about SageMaker and it's built in Algorthims and how to use them and how to tune them." date: 2025-01-29 category: "AWS ML" Amazon SageMaker Built-in Algorithms: A Comprehensive Guide to Machine Learning at Scale Amazon SageMaker stands as AWS's premier machine learning platform, offering a fully managed environment for building, training, and deploying ML models. One of its most powerful features is the extensive collection of built-in algorithms that cover a wide spec