CSV 파일을 여러 줄 JSON으로 변환하는 방법은 무엇입니까?

programing tip

CSV 파일을 여러 줄 JSON으로 변환하는 방법은 무엇입니까?

itbloger 2020. 10. 12. 07:19

CSV 파일을 여러 줄 JSON으로 변환하는 방법은 무엇입니까?

여기 내 코드, 정말 간단한 것들이 있습니다.

import csv
import json

csvfile = open('file.csv', 'r')
jsonfile = open('file.json', 'w')

fieldnames = ("FirstName","LastName","IDNumber","Message")
reader = csv.DictReader( csvfile, fieldnames)
out = json.dumps( [ row for row in reader ] )
jsonfile.write(out)

일부 필드 이름을 선언하고 리더는 CSV를 사용하여 파일을 읽고 파일 이름을 사용하여 파일을 JSON 형식으로 덤프합니다. 여기에 문제가 있습니다 ...

CSV 파일의 각 레코드는 다른 행에 있습니다. JSON 출력이 같은 방식이기를 원합니다. 문제는 모든 것을 하나의 거대하고 긴 줄에 버리는 것입니다.

나는 for line in csvfile:다음 과 같은 것을 사용하고 reader = csv.DictReader( line, fieldnames)각 줄을 반복 하는 코드를 아래에서 실행 하려고 시도했지만 한 줄에서 전체 파일을 수행 한 다음 다른 줄에서 전체 파일을 반복합니다 ... 줄이 떨어질 때까지 계속됩니다. .

이 문제를 해결하기위한 제안 사항이 있습니까?

편집 : 명확히하기 위해 현재 : (1 행의 모든 레코드)

[{"FirstName":"John","LastName":"Doe","IDNumber":"123","Message":"None"},{"FirstName":"George","LastName":"Washington","IDNumber":"001","Message":"Something"}]

내가 찾는 것 : (2 줄에 2 개의 레코드)

{"FirstName":"John","LastName":"Doe","IDNumber":"123","Message":"None"}
{"FirstName":"George","LastName":"Washington","IDNumber":"001","Message":"Something"}

각 개별 필드가 들여 쓰기 / 별도의 줄에있는 것이 아니라 각 레코드가 자체 줄에 있습니다.

샘플 입력.

"John","Doe","001","Message1"
"George","Washington","002","Message2"

원하는 출력의 문제는 유효한 json 문서가 아니라는 것입니다. 그것은 json 문서 의 흐름입니다 !

필요하다면 괜찮습니다.하지만 출력에서 원하는 각 문서에 대해 json.dumps.

문서를 분리하려는 줄 바꿈이 해당 문서에 포함되어 있지 않으므로 직접 제공해야합니다. 따라서 json.dump에 대한 호출에서 루프를 꺼내서 작성된 각 문서에 대한 개행 문자를 삽입하면됩니다.

import csv
import json

csvfile = open('file.csv', 'r')
jsonfile = open('file.json', 'w')

fieldnames = ("FirstName","LastName","IDNumber","Message")
reader = csv.DictReader( csvfile, fieldnames)
for row in reader:
    json.dump(row, jsonfile)
    jsonfile.write('\n')

다음 예제를 통해 Pandas DataFrame을 사용하여이를 달성 할 수 있습니다.

import pandas as pd
csv_file = pd.DataFrame(pd.read_csv("path/to/file.csv", sep = ",", header = 0, index_col = False))
csv_file.to_json("/path/to/new/file.json", orient = "records", date_format = "epoch", double_precision = 10, force_ascii = True, date_unit = "ms", default_handler = None)

@SingleNegationElimination의 응답을 가져와 파이프 라인에서 사용할 수있는 세 줄로 단순화했습니다.

import csv
import json
import sys

for row in csv.DictReader(sys.stdin):
    json.dump(row, sys.stdout)
    sys.stdout.write('\n')

당신은 이것을 시도 할 수 있습니다

import csvmapper

# how does the object look
mapper = csvmapper.DictMapper([ 
  [ 
     { 'name' : 'FirstName'},
     { 'name' : 'LastName' },
     { 'name' : 'IDNumber', 'type':'int' },
     { 'name' : 'Messages' }
  ]
 ])

# parser instance
parser = csvmapper.CSVParser('sample.csv', mapper)
# conversion service
converter = csvmapper.JSONConverter(parser)

print converter.doConvert(pretty=True)

편집하다:

더 간단한 접근

import csvmapper

fields = ('FirstName', 'LastName', 'IDNumber', 'Messages')
parser = CSVParser('sample.csv', csvmapper.FieldMapper(fields))

converter = csvmapper.JSONConverter(parser)

print converter.doConvert(pretty=True)

import csv
import json

file = 'csv_file_name.csv'
json_file = 'output_file_name.json'

#Read CSV File
def read_CSV(file, json_file):
    csv_rows = []
    with open(file) as csvfile:
        reader = csv.DictReader(csvfile)
        field = reader.fieldnames
        for row in reader:
            csv_rows.extend([{field[i]:row[field[i]] for i in range(len(field))}])
        convert_write_json(csv_rows, json_file)

#Convert csv data into json
def convert_write_json(data, json_file):
    with open(json_file, "w") as f:
        f.write(json.dumps(data, sort_keys=False, indent=4, separators=(',', ': '))) #for pretty
        f.write(json.dumps(data))


read_CSV(file,json_file)

json.dumps () 문서

indent매개 변수 추가json.dumps

 data = {'this': ['has', 'some', 'things'],
         'in': {'it': 'with', 'some': 'more'}}
 print(json.dumps(data, indent=4))

또한 json.dumpopen으로 간단히 사용할 수 있습니다 jsonfile.

json.dump(data, jsonfile)

Pandas를 사용하여 csv 파일을 DataFrame ( pd.read_csv ) 으로 읽은 다음 원하는 경우 열을 조작 (삭제 또는 값 업데이트)하고 마지막으로 DataFrame을 다시 JSON ( pd.DataFrame.to_json ) 으로 변환하는 방법은 무엇입니까 ?

참고 : 이것이 얼마나 효율적인지 확인하지 않았지만 이것은 확실히 큰 csv를 json으로 조작하고 변환하는 가장 쉬운 방법 중 하나입니다.

I see this is old but I needed the code from SingleNegationElimination however I had issue with the data containing non utf-8 characters. These appeared in fields I was not overly concerned with so I chose to ignore them. However that took some effort. I am new to python so with some trial and error I got it to work. The code is a copy of SingleNegationElimination with the extra handling of utf-8. I tried to do it with https://docs.python.org/2.7/library/csv.html but in the end gave up. The below code worked.

import csv, json

csvfile = open('file.csv', 'r')
jsonfile = open('file.json', 'w')

fieldnames = ("Scope","Comment","OOS Code","In RMF","Code","Status","Name","Sub Code","CAT","LOB","Description","Owner","Manager","Platform Owner")
reader = csv.DictReader(csvfile , fieldnames)

code = ''
for row in reader:
    try:
        print('+' + row['Code'])
        for key in row:
            row[key] = row[key].decode('utf-8', 'ignore').encode('utf-8')      
        json.dump(row, jsonfile)
        jsonfile.write('\n')
    except:
        print('-' + row['Code'])
        raise

As slight improvement to @MONTYHS answer, iterating through a tup of fieldnames:

import csv
import json

csvfilename = 'filename.csv'
jsonfilename = csvfilename.split('.')[0] + '.json'
csvfile = open(csvfilename, 'r')
jsonfile = open(jsonfilename, 'w')
reader = csv.DictReader(csvfile)

fieldnames = ('FirstName', 'LastName', 'IDNumber', 'Message')

output = []

for each in reader:
  row = {}
  for field in fieldnames:
    row[field] = each[field]
output.append(row)

json.dump(output, jsonfile, indent=2, sort_keys=True)

import csv
import json
csvfile = csv.DictReader('filename.csv', 'r'))
output =[]
for each in csvfile:
    row ={}
    row['FirstName'] = each['FirstName']
    row['LastName']  = each['LastName']
    row['IDNumber']  = each ['IDNumber']
    row['Message']   = each['Message']
    output.append(row)
json.dump(output,open('filename.json','w'),indent=4,sort_keys=False)

참고URL : https://stackoverflow.com/questions/19697846/how-to-convert-csv-file-to-multiline-json

'programing tip' 카테고리의 다른 글

div의 내용 변경-jQuery (0)	2020.10.12
Windows에서 경로 길이가 260자를 초과하는 파일을 찾으려면 어떻게합니까? (0)	2020.10.12
jquery없는 scrollTop 애니메이션 (0)	2020.10.12
리치 vs 빈혈 도메인 모델 (0)	2020.10.12
JSON 요청을위한 AlamoFire 비동기 완료 핸들러 (0)	2020.10.12

현재글CSV 파일을 여러 줄 JSON으로 변환하는 방법은 무엇입니까?

itbloger

CSV 파일을 여러 줄 JSON으로 변환하는 방법은 무엇입니까?

CSV 파일을 여러 줄 JSON으로 변환하는 방법은 무엇입니까?

'programing tip' 카테고리의 다른 글

'programing tip'의 다른글

티스토리툴바

CSV 파일을 여러 줄 JSON으로 변환하는 방법은 무엇입니까?

CSV 파일을 여러 줄 JSON으로 변환하는 방법은 무엇입니까?

'programing tip' 카테고리의 다른 글

'programing tip'의 다른글

관련글

티스토리툴바