Maritime 4.0: Innovation Driven by AI, Data, and Cyber Security

Posts

(ICML 2019) EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

6/05/2021

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks EfficientNet is a highly influential paper that has gained significant attention in the field of image classification due to its outstanding performance . For projects requiring extensive training time or computational resources , EfficientNet serves as a valuable approach to enhancing ConvNet performance . It provides an efficient and scalable method for training convolutional neural networks while optimizing accuracy and computational cost, making it highly applicable for real-world AI deployment . 🔗 Research Paper: EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks 📌 Key Resources & Reviews 📖 Paper Review Summaries: Bellzero’s Review Laonple Blog Review 💻 Source Code (PyTorch Implementation): GitHub: EfficientNet-PyTorch

(Environment) (Cairo) – (1) Installing Required Libraries to Convert SVG Files to PNG

5/24/2021

Installing Required Libraries to Convert SVG Files to PNG The goal was to apply NBP OCR to SVG images and record the output as JSON (text, coordinates) . To achieve this, the first step was to convert the ".svg" file into ".png" . Initially, I thought that simply installing cairo via pip would suffice, but I soon realized that additional steps were necessary . To avoid wasting time solving issues when I attempted it again later, I decided to document the process and share it . If you need code references or encounter any setup issues, please feel free to leave a message here: https://github.com/shipjobs/HAND2TEXT/issues . Environment Language: Python 3.8.3 64bit Operating System: Windows 10 Development Environment: Visual Studio Code To convert an ".svg" file to ".png" , if you import the libraries as shown below, you will naturally encounter a reference error : import cairo from svglib.svglib import svg2rlg import cairosvg For those...

(Cloud) NCP > AI Service > OCR review

5/21/2021

CLOVA OCR (optical character reader) Service review 문서를 인식하고, 사용자가 지정한 영역의 텍스트와 데이터를 정확하게 추출 CLOVA OCR (광학문자인식) 을 한번 사용해 본다면, NCP가 제공 중에 있는 AI Service 들 에 대하여 보다 쉬운 접근이 가능해질 거라 생각 하게 되어 review 를 해보고자 합니다. [접근] Products & service : https://console.ncloud.com/dashboard OCR Service 경로 : Classic / CLOVA OCR / Domain [이용 방식] 서비스 타입 General / Template / Document 선택에 따라 Text OCR / 템플릿 빌더 / Document 버튼이 노출되는 형식으로 서비스를 설정함 Text OCR (텍스트만 추출) 과 Template 빌더 형태 (판독 영역 직접 지정을 통해 인식 값 추출 후 테스트 및 결과 전송이 가능) 는 서비스 타입에 따라 아래의 2가지 방식이 있으며 1. General OCR : 우리가 일반적으로 생각하는 png, jpg이미지 혹은 pdf 에 존재하는 text 들을 모두 읽어 오고자 하는 방식 2. Template OCR : 운전 면허증, 신용 카드, 주민 등록 등본 이미지 등 이미지내 정해진 특정 영역을 기준으로 text 들을 읽어 오는 방식 Document 방식은 머신러닝 기반으로 문서의 의미적 구조를 이해하는 특화 모델 엔진을 탑재하여 입력 정보(key-value)를 자동 추출하는 방식 인식 모델로는 미리 정해진 사업자 등록증, 신용카드,영수증, 신분증, 명함이 제공 되며 이를 선택할 수 있게 되어 있습니다. [이러한 Type 별 서비스를 이용하는 방법 또한 2가지로 구분 될 수 있습니다.] 1. NCP OCR 사이트에 접속해서 제공되는 UI 화면으로 접근 하는 방식으로 원하는 이미지 파일을 drag and drop으로 등록하고 텍...

(CVPR 2019) A Style-Based Generator Architecture for Generative Adversarial Networks

5/13/2021

StyleGAN — Official TensorFlow Implementation Material related to our paper is available via the following links: Paper: https://arxiv.org/abs/1812.04948 Video: https://youtu.be/kSLJriaOumA Code: https://github.com/NVlabs/stylegan FFHQ: https://github.com/NVlabs/ffhq-dataset Additional material can be found on Google Drive:

(NeurIPS 2020) Dynamic allocation of limited memory resources in reinforcement learning

5/06/2021

Dynamic allocation of limited memory resources in reinforcement learning

(CVPR 2020 (Best Paper Award).) Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild

4/24/2021

Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild Shangzhe Wu Christian Rupprecht Andrea Vedaldi Visual Geometry Group, University of Oxford {szwu, chrisr, vedaldi}@robots.ox.ac.uk

(Research) AI Blink Detection and Reminder (1) – Project Introduction

4/14/2021

[Open Source + Research Paper + Jetson Nano] Blinker Project This project aims to implement a blink detection system using dlib and OpenCV , following an existing open-source implementation. It builds upon facial recognition techniques to develop a system capable of detecting eye blinks in real-time. According to the contributor's description , this system can be applied in various scenarios, such as: 🚗 Drowsy driving detection – Alerting drivers when signs of fatigue are detected. 📚 Student monitoring – Analyzing drowsiness and focus levels in study environments. By leveraging Jetson Nano , this project explores the integration of edge AI for real-time blink detection, opening possibilities for applications in safety, education, and human-computer interaction . 🚀 asily Implementable with a Camera and Software Development Setup If you have a camera and a properly configured software development environment , this project is relatively easy to implement. In this project, we...

(Research) Face Recognition (1) – Project Introduction

4/12/2021

Let's explore the Face Recognition Project ! 😊 In this project, we will maximize the use of open-source resources with the following objectives: 1️⃣ Understand key libraries and source code 2️⃣ Set up the required development environment 3️⃣ Optimize performance for better accuracy and efficiency 4️⃣ Share results and discuss potential improvements The goal is to go beyond simple implementation , actively improving the system and exploring ways to enhance its capabilities. 🚀

(ISSN 2249-3905) Natural Language Processing: A Review

4/10/2021

ABSTRACT 1. Introduction 2. Scope and objective 3. Previous Works On NLP (Brief History) 4. Natural Language Processing Overview 5. Applications of NLP 6. Challenges and failures 7. Current and Future progress of NLP 8. Conclusions References

THE BEST ARTIFICIAL INTELLIGENCE JOURNALS

4/07/2021

THE BEST ARTIFICIAL INTELLIGENCE JOURNALS 안녕하세요? 연구 활동 및 학위 취득의 목적, 그리고 기업에 소속되어 논문 등재를 시작해 보려는 이들에게 도움이 될 만한 내용을 정리해 보고자 합니다.. 인공지능 학문에 대하여 연구자 및 엔지니어들이 모이는 학회는 어떤것들이 있으며 각 학회의 역사나 특징, 동향이 궁금해 졌기 때문 입니다. 원하는 논문의 탐색은 구글검색이나 "http://www.arxiv-sanity.com/" 을 통하면 쉽게 얻을 수있겠지만, 각 학회별 특징이나 동향을 아는 것이 우리의 논문 등재를 포함한 연구 활동에 도움이 될것이라 생각 하기에 그 내용을 정리함과 동시에 과정을 지속적으로 공유해 나가고자 합니다. 1. NeurIPS (NIPS) : 연구자 및 엔지니어가 모이는 최대 규모의 연례 학회로 , 새로운 발견을 공유하고 협업하며 함께 AI 산업 발전을 도모하는 장 : 역사 : 1987 년 , 분야 : 인지 과학과 머신러닝 응용 분야 등 폭 넓음 ; 바로가기: https://papers.nips.cc/ 2. ICML : (International Conference on Machine Learning) : 머신 러닝에 집중, NeurIPS 및 ICLR 과 함께 기계 학습 및 인공 지능 연구 에 큰 영향을 미치는 세 가지 주요 컨퍼런스 중 하나 , 정확한 날짜는 해마다 다르지만 일반적으로 논문 제출 마감일은 1 월 말 이며 회의는 일반...