DataOps ãšã¯ïŒ
- ããŒã¿ã®åéã»å å·¥ã»é ä¿¡ãã€ãã©ã€ã³ãã³ãŒãã§ç®¡çããCI/CDã§èªååãã
- ããŒã¿å質ãã¹ãïŒã¹ããŒããã§ãã¯ã»ç°åžžå€æ€ç¥ãªã©ïŒãçµã¿èŸŒãã§åè³ªãæ ä¿ãã
- DevOpsã®æåãããŒã¿ããŒã ã«é©çšããéçºã»åæã»éçšã®é£æºã匷åãã
- Apache Airflowã»dbtã»Great Expectationsãªã©ã代衚çãªDataOpsããŒã«
DataOpsã£ãŠDevOpsãšäœãéãã®ïŒ
DevOpsã¯ã¢ããªã±ãŒã·ã§ã³ã®ã³ãŒãããã«ãã»ãã¹ãã»ãããã€ããèªååã ãã©ãDataOpsã¯ãããŒã¿ãã®ãã®ãã察象ã«ãããã ãããŒã¿ã®åéã»å€æã»å質ãã§ãã¯ã»é ä¿¡ããã€ãã©ã€ã³ãšããŠèªååããŠãåæè ãMLãšã³ãžãã¢ã«ä¿¡é Œã§ããããŒã¿ãçŽ æ©ãå±ããã®ãç®çã ãã
ããŒã¿ã®å質ãã§ãã¯ã£ãŠã©ãããã®ïŒ
ããšãã°Great ExpectationsãšããããŒã«ã䜿ããšããã®ã«ã©ã ã«NULLãå ¥ã£ãŠããªããããå€ãæ³å®ç¯å²å ãããã¬ã³ãŒãæ°ãæ¥ã«å¢æžããŠããªããããšãã£ããã¹ããèªåã§å®è¡ã§ããããããŒã¿ã«ããŠããããã¹ããæžãæèŠã ãã
ãã€ãã©ã€ã³ã£ãŠå ·äœçã«äœãããã®ïŒ
ããšãã°ãæ¯æ¥æ·±å€ã«APIããããŒã¿ãååŸâäžèŠãªã«ã©ã ãåé€ã»åã倿âããŒã¿ãŠã§ã¢ããŠã¹ã«ããŒãâBIããã·ã¥ããŒããæŽæ°ããšããäžé£ã®åŠçããããAirflowãªã©ã®ããŒã«ã§DAGïŒæåéå·¡åã°ã©ãïŒãšããŠå®çŸ©ããŠãã¹ã±ãžã¥ãŒã«å®è¡ã»ãšã©ãŒéç¥ãèªååãããã ã
DataOpsãå°å ¥ãããšã©ããªããããšãããã®ïŒ
ãæšæ¥ã®ã¬ããŒãã®ããŒã¿ããªãããããããã§ããã©ãã¿ãããªåãåãããæ¿æžãããããŒã¿ã®ä¿¡é Œæ§ãäžãã£ãŠåæè ãå®å¿ããŠããŒã¿ã䜿ãããããã€ãã©ã€ã³ã®ä¿®æ£ãCI/CDã§å®å šã«ãªãªãŒã¹ã§ãããããŒã¿å質ã®åé¡ã«æ°ã¥ãã®ãããŠãŒã¶ãŒããã®å ±åãã§ã¯ãªããèªåãã¹ãã®ã¢ã©ãŒããã«ãªãã®ã倧ãããã
MLOpsãšã®é¢ä¿ã¯ã©ããªã£ãŠãã®ïŒ
DataOpsã¯MLOpsã®åå°ãšãèšããããæ©æ¢°åŠç¿ã¢ãã«ã¯ããŒã¿ãåœã ãããDataOpsã§ããŒã¿ã®å質ãšäŸçµŠãå®å®ãããŠããMLOpsã§ã¢ãã«ã®éçšãåãããšããé¢ä¿ã ããDataOpsãå£ãããšMLOpsãé£éçã«å£ãããããäž¡æ¹ã»ããã§èããã®ãçæ³ã ãã