Maritime 4.0: Innovation Driven by AI, Data, and Cyber Security

Posts

Showing posts from May, 2021

(Environment) (Cairo) – (1) Installing Required Libraries to Convert SVG Files to PNG

5/24/2021

Installing Required Libraries to Convert SVG Files to PNG The goal was to apply NBP OCR to SVG images and record the output as JSON (text, coordinates) . To achieve this, the first step was to convert the ".svg" file into ".png" . Initially, I thought that simply installing cairo via pip would suffice, but I soon realized that additional steps were necessary . To avoid wasting time solving issues when I attempted it again later, I decided to document the process and share it . If you need code references or encounter any setup issues, please feel free to leave a message here: https://github.com/shipjobs/HAND2TEXT/issues . Environment Language: Python 3.8.3 64bit Operating System: Windows 10 Development Environment: Visual Studio Code To convert an ".svg" file to ".png" , if you import the libraries as shown below, you will naturally encounter a reference error : import cairo from svglib.svglib import svg2rlg import cairosvg For those...

(Cloud) NCP > AI Service > OCR review

5/21/2021

CLOVA OCR (optical character reader) Service review 문서를 인식하고, 사용자가 지정한 영역의 텍스트와 데이터를 정확하게 추출 CLOVA OCR (광학문자인식) 을 한번 사용해 본다면, NCP가 제공 중에 있는 AI Service 들 에 대하여 보다 쉬운 접근이 가능해질 거라 생각 하게 되어 review 를 해보고자 합니다. [접근] Products & service : https://console.ncloud.com/dashboard OCR Service 경로 : Classic / CLOVA OCR / Domain [이용 방식] 서비스 타입 General / Template / Document 선택에 따라 Text OCR / 템플릿 빌더 / Document 버튼이 노출되는 형식으로 서비스를 설정함 Text OCR (텍스트만 추출) 과 Template 빌더 형태 (판독 영역 직접 지정을 통해 인식 값 추출 후 테스트 및 결과 전송이 가능) 는 서비스 타입에 따라 아래의 2가지 방식이 있으며 1. General OCR : 우리가 일반적으로 생각하는 png, jpg이미지 혹은 pdf 에 존재하는 text 들을 모두 읽어 오고자 하는 방식 2. Template OCR : 운전 면허증, 신용 카드, 주민 등록 등본 이미지 등 이미지내 정해진 특정 영역을 기준으로 text 들을 읽어 오는 방식 Document 방식은 머신러닝 기반으로 문서의 의미적 구조를 이해하는 특화 모델 엔진을 탑재하여 입력 정보(key-value)를 자동 추출하는 방식 인식 모델로는 미리 정해진 사업자 등록증, 신용카드,영수증, 신분증, 명함이 제공 되며 이를 선택할 수 있게 되어 있습니다. [이러한 Type 별 서비스를 이용하는 방법 또한 2가지로 구분 될 수 있습니다.] 1. NCP OCR 사이트에 접속해서 제공되는 UI 화면으로 접근 하는 방식으로 원하는 이미지 파일을 drag and drop으로 등록하고 텍...

(CVPR 2019) A Style-Based Generator Architecture for Generative Adversarial Networks

5/13/2021

StyleGAN — Official TensorFlow Implementation Material related to our paper is available via the following links: Paper: https://arxiv.org/abs/1812.04948 Video: https://youtu.be/kSLJriaOumA Code: https://github.com/NVlabs/stylegan FFHQ: https://github.com/NVlabs/ffhq-dataset Additional material can be found on Google Drive:

(NeurIPS 2020) Dynamic allocation of limited memory resources in reinforcement learning

5/06/2021

Dynamic allocation of limited memory resources in reinforcement learning