Overview# I confirmed the workflow for processing Dataverse data with Archivematica, so here are my notes.
Background# Archivematica provides a feature to input data from Dataverse.
https://www.archivematica.org/en/docs/archivematica-1.17/user-manual/transfer/dataverse/
I learned about this feature at the following lecture, so I decided to try it out.
https://www.kulib.kyoto-u.ac.jp/bulletin/1402322
Dataverse# I used the Demo Dataverse that was also used in the following article.
I uploaded the following data.
https://demo.dataverse.org/dataset.xhtml?persistentId=doi:10.70122/FK2/IHQZL3
From here, download both the image data itself and the JSON data. Go to the Metadata tab and select JSON from Export Metadata.
Below is a part of the JSON file. metadataBlocks contains the metadata and files contains the image file information.
{ } " } " ] m , f e " } i { } t c l a i " " " ] e " " " " " } d t d n f s l r v d d a a i a i { } { } { } { } { } { } { } " a e e a a " " " " " " " " " " " } " " " " t t s m e , , , , , , : b s r t t i p p f c f f s r m c , t c p f a i p e l " " " " " " " " ] " " " " ] " " " " ] " " " " ] " " " " " " " " e t s a a d e i i o r i t o d h " " a r u i B o l " d t m t v t m t v t m t v t m t v t m t v t m t v t m t v [ l r i s F " r d l n i l o o 5 e t v b e b l l n a : s y u y a y u y a { } y u y a { } y u y a { } y u y a " y u y a y u y a " i o e i : s U e t e e r t " c y a u a l e o " y " p l p l p l p l p l p l p l p l p l p l A p l p l p l p l : c n t l i R n e n s a D : k p l l t i A c : N " : e t e u e t e u " } " } e t e u " } " } e t e u " } e t e u r e t e u e t e u t " V e 2 s L a n d i g a s e u a i c c k a c N i C e N i C e a , a N i C e d , d N i C e d N i C e t N i C e N i C e " e : e " 5 t " m t l z e t " u " e r o a c s { m i [ a p l " a p l " u " " " " u " " " " " } a p l " a " " " " a " " " " a p l " s " " " " a p l " s a p l " a p l " n d r : 1 e : e T y e I a 7 m : " D n t e " e t m l a : m l a : t t m t v t t m t v e m l a : t t m t v t t m t v m l a : D t m t v m l a : m l a : m l a : a " 1 s 4 n " y T " d F 2 " : a D i s : " a e e s e e s h y u y a h y u y a x " " " e e s a y u y a a y u y a e e s e y u y a e e s a e e s e e s k : , i { 7 t " : p y : e i f : " t a o s : t " " s " " " s [ o p l p l o p l p l p s t @ " " s [ s p l p l s p l p l " " s [ s p l p l " " s [ n " " s " " " s " a o 2 I h e p n l 0 M " a t n R { i : : " n : : " r e t e u r e t e u a c e t : : " e e t e u e e t e u : : " c e t e u : : " d : : " N : : " 2 m f n 4 d t " " e 5 t e 8 { D 7 " e D e " o : a : N N i C e A N i C e n h r y : t N i C e t N i C e : r N i C e : : a : 0 u a I , " t n : " 3 i I a 5 2 : " a q C n " f k " t a a p l " f a p l " d e m p " t C a p l " C a p l " " t i a p l " " t H " f k " f 2 r l d : p a : 6 f d 8 " f : t u i " t a " a a r " m m l a : f m l a : e m N e d r " o m l a : o m l a : d r " p m l a : s r " u d a " a d a " 5 a s " s k " 5 i " b , 0 f e e t , i l p m u u c e e e s i e e s d e a " a u c n e e s n e e s s u c t e e s u u c m e l p m a l p - 1 e : " : a i " 6 e : 0 8 a " " s a t s r u t e o " " " s " l " " s " v " m : t e o t " " s " t " " s " D e o i " " s " b e o a p s r u t s r 0 9 , d m m J , r 7 a l 2 : t t l e i r h , m : : : " N i : : " h a : e a , m a : : " N a : : " n e , m o : : " M j , n n o e i r e e i 1 6 2 / u a P " - b 8 s 0 " i e , m a o p : a a : t l " " s p c : a c : a s p n : y e t i s , m a O , m - . 8 i d r g E : 1 a b e 2 " : o " i 1 r o { " f k t " f t u " : h e o t " f k t " f . c o V " f c r t i i , f i 1 j 1 : o a e G , c 0 , 5 2 n , t 9 " u a a " a i a a " p e h t t u N d a " a E d a " k r u a d a " F t o i t t D t 9 p 0 1 i 1 / " b 7 - 0 t i 6 , n u l p m o u l p s " t " t C n a a l p m m a l p a i n l s l p i " l e o i S e i " g 9 0 . 9 j I s e b 0 2 r M v " d t s r u n t s r : : t T p o d m t s r u a t s r m p d u D s r r , l s r v a p v " 3 . o 6 p m 3 3 a 1 5 u e e " h e i r " h e i / p h s n " e a e i r i a e i u t " e e e i s e " " e t o e , , 7 r . e a : b c - - e t " , o , m a : o , m / { : e : t , " s , m a l s , m r i , " s , m t d , " o s " 0 g j g g / 5 b 1 0 a , r i , r i r / / a : e i , " e i a o : c i V , r i , 1 / p " e / c e 9 1 d N t { A t o / U / c t t : t t . n r t D o u t 2 1 g , " d f 3 " - a a i S f i r w n s t { C i S C i 1 " { i i a c " " 2 0 " , e 0 b , 1 t m v a f v . w i c " o v a { o v 2 , p v t a , / . , m 2 5 9 a e e t i e o w v h , n e t n e 6 t e a b F 7 o 1 c " " " " o l " r . e e t " o t " 3 i " s u K 0 - 9 f , , , , r i , g g r m a , r a , @ o , e l 2 1 d 1 0 u a / r s a c u c g n t a / 2 a 0 2 " t 0 i i . t " t m V " r I 2 t f 1 i 5 d t o N E a a y H / a d 9 o 7 . y r a m i l " Q F v 2 1 n z a g m a l u , Z K e 6 0 " h c / e i . e L 2 r d f , 3 f O " l c " 3 s c d y o r , " o , I e " 2 9 n T g , m B H - , 6 6 t o a " 7 Q o d " o k n J Z r c , l y i V L g " o o z Q 3 : g " a S / 1 y , t " B 9 / i , 7 4 " o J 8 , n V 1 " Q 5 S 4 " 8 , 2 0 d - 6 3 7 3 3 5 3 3 e a 7 c " , Data Preparation# Dataverse sample data is stored at the following location.
https://github.com/artefactual/archivematica-sampledata/tree/master/SampleTransfers/Dataverse
Let’s store the JSON file downloaded from Dataverse as dataset.json in the metadata folder. Specifically, it looks like the following.
Here, referencing the following article, I prepared data in the mdx.jp object storage connected from GakuNin RDM, and processed it from Archivematica connected to the same object storage.
Processing in Archivematica# Set the Transfer type to “Dataverse”, select the folder created earlier, and start processing.
As a result, a METS file was created as follows. Whether it’s a problem with the data registration method or a bug is unclear, but dmdSec_1 was created twice. However, the contents of dataset.json were described in DDI format.
< m e < < < < < m t m m / m / m / m e s e e < m e < m e < m e < m t : t t m / e t m / e t m e t m e s m s s e < m t s e < m t s e t s e t : e : : t m / e s : t m / e s : t s : t s m t m d s e < m t : d s e < m t : d s : a s : e s e m : t d / e s d m : t d / e s d m : d m : a t t d m s d < d t : m d m s d < d t : m d m m d t m s x s S d : i d / d s m d S d : i d / d s m d S d d S e d > m H e W x : d < < d i : d S e W x : d < < d i : d S e R S e c S l d c r m c i d / d / d : x W e c r m c i d / d / d : x W e c e e c h e n r a l o : d < < < < d d < d i c m r c a l o : d < < < < d d < d i c m r c f c M c s I p D d s i d / d / d / d / d i d / d : o l a > I p D d s i d / d / d / d / d i d / d : o l a > I > I D > : C D a e t : d < < d d < d d < d d < d i : d < d i s d D p D a e t : d < < d d < d d < d d < d i : d < d i s d D p D L D x R = M t b d c i d d d i d d i d d i d d : d i d d : t e a > = M t b d c i d d d i d d i d d i d d : d i d d : t e a > = A = I s E " D a o y i : d d i : d i : d i : d i c a : d i d d b t " D a o y i : d d i : d i : d i : d i c a : d i d d b t " B " D i A d T > o D t t i i : r i : d i : v i : i t u i : a y o a d T > o D t t i i : r i : d i : v i : i t u i : a y o a d E a = = T m Y k s a i : : t s : r i : d e : v t a s : u t D o > m Y k s a i : : t s : r i : d e : v t a s : u t D o > m L m " " E d P c t t t I i p A s s d i r v e a A e r s a s k d P c t t t I i p A s s d i r v e a A e r s a s k d = d t h D S E x r i l i D t S u p t i s S e r t c S e e A c > S E x r i l i D t S u p t i s S e r t c S e e A c > S " S e t A e = m > o S t N l t t S S s t t r S i c t s S c r e = m > o S t N l t t S S s t t r S i c t s S c r e d e c t T c " l n t l o S m h t t t S m s t o s m t t c > c " l n t l o S m h t t t S m s t o s m t t c > c a c h p E _ D n > m > t t E m m r t t i m n > t r m s _ D n > m > t t E m m r t t i m n > t r m s _ t _ M : = 1 D s t n a m > n t t b m > o t > > c t > 1 D s t n a m > n t t b m > o t > > c t > 2 a 1 D / " " I : > a g t t > > t t n > t > " I : > a g t t > > t t n > t > " s " _ / 2 " d k e > y r > n " d k e > y r > n e > 1 w 0 C > d a n > d C > d a n > d C t " w 2 R i m c a D a R i m c a D a R . > w 5 E = u y f e t E = u y f e t E j . . - A " r = f m e A " r = f m e A s . w 0 T h a " i o = T h a " i o = T o . 3 1 E t 1 d l " E t 1 d l " E n < . - D t 9 o i D 2 D t 9 o i D 2 D " / o 2 = p 6 i a a 0 = p 6 i a a 0 = m r 1 " : < " t t 2 " : < " t t 2 " x e g T 2 / / > i a 5 2 / / > i a 5 2 l t / 0 0 / d h o v - 0 / d h o v - 0 i s 2 7 2 w d t n e 0 2 w d t n e 0 2 n : 0 : 5 w i t = r 1 5 w i t = r 1 5 k t 0 2 - w : p " s - - w : p " s - - : e 1 8 0 . t s h e 2 0 . t s h e 2 0 h c / : 1 i i : t < 0 1 i i : t < 0 1 r h X 1 - c t / t / T - c t / t / T - e M M 3 2 p l / p d 0 2 p l / p d 0 2 f D L " 1 s > d s d 1 1 s > d s d 1 1 = > S T r o : i : T r o : i : T " c 0 . i / : 3 0 . i / : 3 0 m h 7 u . / d 0 7 u . / d 0 7 e e : m o r i : : m o r i : : t m 2 i r o s 2 2 i r o s 2 2 a a 7 c g r t 3 7 c g r t 3 7 d - : h / . r Z : h / . r Z : a i 5 . 1 o b " 5 . 1 o b " 5 t n 7 e 0 r t 7 e 0 r t 7 a s " d . g r t " d . g r t " / t u 7 / > y u 7 / > y d a S / 0 0 p S / 0 0 p S a n T D 1 5 e T D 1 5 e T t c A D 2 7 = A D 2 7 = A a e T I 2 z " T I 2 z " T s " U " / h R U " / h R U e S F 3 E S F 3 E S t x = v K y L = v K y L = . m " e 2 9 E " e 2 9 E " j l o r / 6 A o r / 6 A o s n r s I " S r s I " S r o s i i H > E i i H > E i n : g o Q N D g o Q N D g " x i n Z a " i n Z a " i l n = L k > n = L k > n M i a " 3 a 1 a " 3 a 1 a D n l 2 < m . l 2 < m . l T k " . / u 0 " . / u 0 " Y = > 5 d r < > 5 d r < > P " " d a / " d a / E h i , d i , d = t x : d x : d " t s I S i s I S i O p i D a : i D a : T : : N t v : N t v H / s o o e s o o e E / c > r r c > r r R w h u s h u s " w e < i e < i w m / o m / o L . a d n a d n O w L d > L d > C 3 o i o i T . c : c : Y o a A a A P r t u t u E g i t i t = / o h o h " 1 n E n E O 9 = n = n T 9 " t " t H 9 h y h y E / t > t > R x t t " l p p i : : O n / / T k / / H " w w E w w R x w w L m . . O l d d C n d d T s i i Y : : : P m c c E e o o = t d d " s e e S = b b Y " o o S h o o T t k k E t : : M p 2 2 " : _ _ / 5 5 / w h h w t t w t t . p p l : : o / / c / / . w w g w w o w w v . . / d d M d d E i i T a a S l l / l l " i i a a x n n s c c i e e : . . s o o c r r h g g e / / m S S a p p L e e o c c c i i a f f t i i i c c o a a n t t = i i " o o h n n t / / t D D p D D : I I / - - / C C w o o w d d w e e . b b l o o o o o c k k . / / g 2 2 o . . v 5 5 / / / M X X E M M T L L S S S / c c h h h e e t m m t a a p / / : c c / o o / d d w e e w b b w o o . o o l k k o . . c x x . s s g d d o " " v > > / s t a n d a r d s / m e t s / v e r s i o n 1 1 2 1 / m e t s . x s d " > Summary# I found this to be a very useful feature when considering long-term preservation of research data. I hope this serves as a helpful reference for connecting Dataverse and Archivematica.