Read all the txt files in a folder and then work with them

Question

Read all the txt files in a folder and then work with them

Navigation

#1 by (1 votes)
#2 by (0 votes)

3

What I'm working on now is a thread that has to constantly look into a "My Documents" folder and see if it finds files .txt . If you find them, read them, validate the data and send them to the database.

The issue is that I had already done the thread looking for a particular file and I was doing wonders, but then the professor corrected me that the program has to look for any file .txt that be in the folder.

I'm trying to unite what I already had done with new code that I look for files .txt and then with for take each one, the problem is within that for , I sent everything I had before in there, but it's not working for me. Well, my code so far is like this:

public void run() {

    while (true) {
        try {
            Thread.sleep(20000);

            boolean flag = false;

            try {

                FilenameFilter filter = new FilenameFilter() {
                    public boolean accept(File dir, String fileName) {
                        return fileName.endsWith("txt");
                    }
                };

                File f = new File("D:\Mis Documentos\");
                String[] fileList = f.list(filter);
                for (int i = 0; i < fileList.length; i++) {

                     System.out.println(fileList[i]);
                    FileReader fr = new FileReader(fileList[i]);
                    //FileReader fr = new FileReader("D:\Mis Documentos\Contactos.txt");
                    BufferedReader b = new BufferedReader(fr);
                    String cadena = b.readLine();
                    Validaciones v = new Validaciones();

                    if (cadena == null) {

                        JOptionPane.showMessageDialog(null, "El fichero está vacío");

                    } else {

                        while (cadena != null) {

                            String[] fields = cadena.split(SEPARATOR);

                            EntidadDAO ed = new EntidadDAODB();
                            Entidad ent = new Contacto();

                            if (fields.length == 7) {

                                // System.out.println("fields es 7");
                                for (int i2 = 0; i2 < fields.length; i2++) {

                                    nombre = fields[0];
                                    apellido = fields[1];
                                    alias = fields[2];
                                    direccion = fields[3];
                                    telefono = fields[4];
                                    nacimiento = fields[5];
                                    email = fields[6];

                                }

                                if (v.validarThread(nacimiento, email, telefono) == true) {

                                    System.out.println("El contacto " + nombre + " se validó correctamente");
                                } else {

                                    System.out.println("No se pudo cargar el contacto " + nombre + ". Error al validar los datos.");
                                    cadena = b.readLine();
                                    continue;

                                }

                            ((Contacto) ent).setNombre(nombre);
                            ((Contacto) ent).setApellido(apellido);
                            ((Contacto) ent).setAlias(alias);
                            ((Contacto) ent).setDireccion(direccion);
                            ((Contacto) ent).setTelefono(telefono);
                            ((Contacto) ent).setNacimiento(nacimiento);
                            ((Contacto) ent).setEmail(email);
                            ((Contacto) ent).setFavorito("no");
                            // System.out.println(ent.toString());
                            //  ed.agregarEntidad(ent);
                            cadena = b.readLine();

                            } else {

                                System.out.println("No se pudo cargar el contacto " + nombre + ". Faltan datos.");
                                cadena = b.readLine();
                            }

                        }
                    }
                    b.close();
                    //      moverFichero();
                } // aca termina el for que lee cada archivo txt.

            } catch (FileNotFoundException e) {

                System.out.println("No se encontró el fichero");

            } catch (Exception e) {

                System.out.println("Ocurrió un error al leer o cerrar el fichero" + e);

            }

        } catch (InterruptedException e) {
            Thread.currentThread().interrupt();
            System.out.println("Error al ejecutar thread");
        }
    }
}

java csv

asked by Ana 14.12.2018 в 06:03

source

2 answers

Error in build.gradle "should not contain a package statement Delete a record from a table and this is automatically removed in Angular

score 1 · Answer 1

Well with this I answer your question of how to search in a directory which is a file of a certain type, in this case I created a constant variable in which I only put .txt since it was what required.

On the other hand say that I have created a class with the specific function of looking for this extension, so that it is applicable to any other point of the application and only instantiating an object of this class can have it thus encapsulating the code completely. builder parameters.

Even if you would like to search for several types of extension at the same time, you could create objects of the class by changing the extension to search.

import java.io.BufferedReader;
import java.io.File;
import java.io.FileReader;
import java.io.FilenameFilter;
import java.util.logging.Level;
import java.util.logging.Logger;

class LectorArchivos extends Thread {

    private final String EXTENSION_A_BUSCAR = ".txt";
    private final int DELAY = 1500;

    private String directorioCarpetaAExaminar = "";
    private boolean encendido = true;

    public LectorArchivos(String dir) {
        directorioCarpetaAExaminar = dir;
    }



    @Override
    public void run() {

        File dir = new File(directorioCarpetaAExaminar);
        while (encendido) {

            File[] files = dir.listFiles(new FilenameFilter() {
                public boolean accept(File dir, String name) {
                    return name.toLowerCase().endsWith(".txt");
                }
            });

            for (int i = 0; i < files.length; i++) {
                File file = files[i];
                System.out.println(file);
            }
            if (files.length == 0) {
                System.out.println("El directorio no contiene extensiones de tipo '"+EXTENSION_A_BUSCAR+"'");
            }


            System.out.println("--");

            try {
                sleep(DELAY);
            } catch (InterruptedException ex) {
                Logger.getLogger(LectorArchivos.class.getName()).log(Level.SEVERE, null, ex);
            }
        }
    }

    public void setDirectorioABuscar(String dir) {

        this.directorioCarpetaAExaminar = dir;
    }

    public void apagar_encender_thread() {
        this.encendido = !encendido;
    }

}

public class ThreadLectorArchivos {

    /**
     * @param args the command line arguments
     */
    public static void main(String[] args) {
        LectorArchivos la = new LectorArchivos("tuDirectorio");
        la.run();
    }

}

score 0 · Answer 2

there is another method to implement

thread that has to constantly look at a folder «My Documents»

this by using java.nio.file.WatchService , to know more about this functionality read this tutorial Oracle this allows us in a more efficient way to register a "File Watcher" that allows us to run a loop waiting for changes to a particular Folder. but this only works for files that modify or add if the folder already has files that need to be processed we must use java.nio.file.DirectoryStream here more about the DirectoryStream

to show a demo do not implement the database part since I do not have the details of these classes but if I implement the part of inspecting the folders and read the .txt files in Csv format.

import java.io.IOException;
import java.nio.file.DirectoryStream;
import java.nio.file.FileSystems;
import java.nio.file.Files;
import java.nio.file.Path;
import java.nio.file.PathMatcher;
import java.nio.file.Paths;
import static java.nio.file.StandardWatchEventKinds.ENTRY_MODIFY;
//import static java.nio.file.StandardWatchEventKinds.ENTRY_CREATE;
import java.nio.file.WatchEvent;
import java.nio.file.WatchKey;
import java.nio.file.WatchService;
import java.util.HashMap;
import java.util.LinkedList;
import java.util.Optional;

public class Watcher {

    private final static PathMatcher TXTMATCHER = FileSystems.getDefault().getPathMatcher("glob:*.txt");
    private final WatchService watcher;
    private final HashMap<WatchKey, Path> keys;
    private boolean trace = false;

    @SuppressWarnings("unchecked")
    static <T> WatchEvent<T> cast(WatchEvent<?> event) {
        return (WatchEvent<T>) event;
    }

    public Watcher(Path dir) throws IOException {
        this.watcher = FileSystems.getDefault().newWatchService();
        this.keys = new HashMap<>();
        register(dir);
        this.trace = true;
    }

    /**
     * Register the given directory with the WatchService
     */
    private void register(Path dir) throws IOException {
        WatchKey key = dir.register(watcher, ENTRY_MODIFY);//o ENTRY_CREATE
        if (trace) {
            Path prev = keys.get(key);
            if (prev == null) {
                System.out.format("register: %s\n", dir);
            } else {
                if (!dir.equals(prev)) {
                    System.out.format("update: %s -> %s\n", prev, dir);
                }
            }
        }
        keys.put(key, dir);
    }

    /**
     * Process all events for keys queued to the watcher
     */
    void processEvents() {
        while (true) {
            //esperar a un evento de "archivo creado, modificado"
            WatchKey key;
            try {
                key = watcher.take();
            } catch (InterruptedException x) {
                return;
            }

            Path dir = keys.get(key);
            if (dir == null) {
                System.err.println("Folder no Reconocido!!");
                continue;
            }

            key.pollEvents().forEach((event) -> {
                WatchEvent.Kind kind = event.kind();
                WatchEvent<Path> ev = cast(event);
                Path name = ev.context();
                Path child = dir.resolve(name);
                System.out.format("%s: %s\n", event.kind().name(), child);
                if (TXTMATCHER.matches(name)) {
                    procesartxt(child);
                }
            });

            // reset la llave de watcher a manera de asegurar si aun es necesario esperar eventos dentro del folder
            boolean valid = key.reset();
            if (!valid) {
                keys.remove(key);
                if (keys.isEmpty()) {
                    // si ya no hay folders a escuchar salir del loop/thread
                    break;
                }
            }
        }
    }

    public static void main(String[] args) throws IOException {
        //leer los archivos que YA estan en el folder
        Path dir = Paths.get("C:\Desktop\test");
        try (DirectoryStream<Path> stream = Files.newDirectoryStream(dir, "*.txt")) {
            stream.forEach(txtpath -> {
                procesartxt(txtpath);
            });
        }
        //Logear un visor espera de que se "edite o agregen datos al folder"
        new Watcher(dir).processEvents();
    }

    private static void procesartxt(Path child) {
        CsvParser parser = new CsvParser(child, Optional.empty());
        LinkedList<LinkedList<String>> csvdata = parser.Parse();
        if (csvdata != null) {
            csvdata.forEach(a -> {
                a.forEach(b -> {
                    System.out.printf("\"%s\",", b);
                });
                System.out.println();
            });
        }
    }
}

then use a version that is slightly modified from this class to read the CSV

import java.io.BufferedReader;
import java.io.IOException;
import java.nio.charset.Charset;
import java.nio.file.Files;
import java.nio.file.Path;
import java.util.LinkedList;
import java.util.Objects;
import java.util.Optional;
import java.util.logging.Level;
import java.util.logging.Logger;

/**
 * CvsParser. a CSV File parser compliance with
 * <a href="https://tools.ietf.org/html/rfc4180">RFC4180</a>
 *
 * @see <a href="https://tools.ietf.org/html/rfc4180">RFC4180</a>
 * @author Eduardo
 */
public class CsvParser {

    /**
     * The CSV Separator as defined by RFC4180
     */
    public static final char SEPARATOR = (char) 0x2C;
    /**
     * Line limiter(s) for File as per definition on RFC4180 RFC4180 DEFINES
     * that the rows NEEDS to be separated by CRLF HOWEVER this is not followed
     * by all implementations some might use just CR others might only use LF,
     * we will warn if only one is present but will try our best to use either.
     */
    public static final char CR = '\r', LF = '\n';
    /**
     * the escape character as define in RFC4180
     */
    public static final char ESCAPE_CHR = '"';
    /**
     * Path to Read/write the Files.
     */
    private final Path ReadPath;
    /**
     * Character Set Used to Decode the File Characters from Raw Bytes this is
     * required to ensure the right Character is printed or "understood" by Java
     * UTF-16 characters if none is provided we will use the Java Default set on
     * this VM (likely "UTF-8" or "UTF-16")
     */
    private final Charset FileCharset;

    /**
     * Initialize the CvsParser instance using the
     *
     * @param Read Path to the csv file to be read
     * @param FileCharset the Optional Parameter to set for the Character Set
     * used to decode the file into Java Characters
     */
    public CsvParser(final Path Read, Optional<Charset> FileCharset) {
        //check non null 
        this.ReadPath = Objects.requireNonNull(Read, "Read Path is Null! please provide a valid Path");
        //check we can read from Read File 
        if (!Files.isReadable(ReadPath)) {
            Logger.getLogger(CsvParser.class.getName()).log(Level.WARNING, "Warning File does not exist, Permissions are missing, or is not Readable ");
        }
        if (FileCharset == null) {
            //wtf! this is just lazy! 
            FileCharset = Optional.empty();
        }
        this.FileCharset = FileCharset.orElse(Charset.defaultCharset());
    }

    /**
     * Parse the Text File using CSV format into a grid(a LinkedList of rows)
     * the LinkedList is of "rows"(a grid), each row is represented by a
     * LinkedList of Strings
     *
     * @return a grid(LinkedList of Rows)
     */
    public LinkedList<LinkedList<String>> Parse() {
        LinkedList<LinkedList<String>> grid = null;
        //try with resourse 
        try (BufferedReader br = Files.newBufferedReader(ReadPath, FileCharset)) {
            grid = new LinkedList<>();
            StringBuilder buffer = new StringBuilder();
            boolean escaped = false;
            int read;
            while ((read = br.read()) >= 0) {
                char readchr = (char) read;
                //lets see what kind if Character it is. 
                switch (readchr) {
                    case ESCAPE_CHR:
                        escaped = parseEscape(escaped, buffer, br, readchr);
                        break;
                    case SEPARATOR:
                        parseSeparator(escaped, grid, buffer, readchr);
                        break;
                    /*here we check for the "row" separator... 
                      RFC4180 DEFINES that the Row separator SHOULD have CR followed by LF
                      however there are CSV that use only one of them either CR or LF
                      and some are formal and use both. 
                      for better support lets asume we have a file that is a colash of separators 
                      HOWEVER we will warn about this on the logs
                     */
                    case CR:
                    case LF:
                        ParseLine(escaped, grid, buffer, br, readchr);
                        break;
                    //for any other character just feed it into the buffer.
                    default:
                        buffer.append(readchr);
                        break;
                }

            }
            //while finish, however there might be uncommited data
            //on the buffer that needs to go into the csvdata 
            if (buffer.length() > 0) {
                commitBuffer(grid, buffer);
                buffer = null;
            }
        } catch (IOException ex) {
            Logger.getLogger(CsvParser.class.getName()).log(Level.SEVERE, "Error Reading The CSV File.", ex);
        }
        //try with resource will close the Stream on its own. no need for finally statement. 
        return grid;
    }

    /**
     * this method is to be called when a Escape is detected, this method will
     * review if the escape is set. if it is set it will then check if the
     * escape character is on itself "escape" so it should be handled as literal
     * and just be added into the buffer (and ignore the duplicate value as it
     * is intended to escape.) as per
     * <a href="https://tools.ietf.org/html/rfc4180">RFC4180</a>
     * definition.
     *
     * @param escape the status of the Escape flag, if set to true will check if
     * the escape character is followed by another escape character and handle
     * it as a literal adding it to the buffer. otherwise setting and returning
     * the flag as false;<\br>
     * if the flag status is False it will set as true until another escape is
     * detected.
     * @param buffer the buffer to add the Characters into. if a flag is to be
     * handled as literal it will be appended to this buffer.
     * @param br the bufferedReader to read Characters from needed to read the
     * next character to determine if is a end escaping or to be handle as
     * literal
     * @param readchr the read character (a escape character can be inferred)
     * @return the Flag new status This should be used as escape is passed as
     * value not as a reference.
     * @throws IOException if an Exception is detected upon read the buffer.
     */
    private boolean parseEscape(boolean escape, StringBuilder buffer,
            BufferedReader br, char readchr) throws IOException {
        //it is a escape Charater, if we are Alredy escaping lets
        //handle if it is a quotation 
        if (escape) {
            //lets set a bookmark to return if we need to undo. 
            br.mark(2);
            char nextchar = (char) br.read();
            if (nextchar == readchr) {
                //this is a quote character on the field. so its not to finish the "escaping"
                //so we merge this 2 character to a single "double quote"
                buffer.append(readchr);
            } else {
                //this character is not a 2DQUOTE thefore this ends the escaping
                escape = false;
                //lets go back to the bookmark we did as we need to process that nextchar on the next loop
                br.reset();
            }
        } else {
            //we are not yet escaping, so since we found a ESCAPECHAR lets begin
            escape = true;
        }
        return escape;
    }

    /**
     * Commits the Data from the StringBuilder buffer into the Grid. this is
     * done in the following manner if the grid is empty (no rows on the grid) a
     * new Row(LinkedList<String>) will be added. next it will take the Last row
     * from the Grid and add the buffer data into the Row. next will set the
     * buffer length to 0 (to clean the buffer in a computation efficient
     * manner)
     *
     * @param grid a LinkedList<LinkedList<String>> that represents the Grid of
     * data.
     * @param buff a StringBuilder that holds the buffer (read data from the
     * file to be committed into the grid)
     */
    private void commitBuffer(LinkedList<LinkedList<String>> grid, StringBuilder buff) {
        if (grid.isEmpty()) {
            //if the grid is empty we need to add the first row and start filling its content 
            grid.add(new LinkedList<>());
        }
        //get the "last known row" and add a Cell(the String data) to the end of the Row
        grid.getLast().add(buff.toString());
        //for perforance, lets "clean the lenght of the buffer"
        //this is not "secure", but is quite efficient 
        buff.setLength(0);
        //if the data needs to be nulled "secure" but slow use this line
        //buff.trimToSize(); //clear the underline Array
    }

    /**
     * this method is to be called when a Separator is detected, this method
     * will review if the escape is set. if it is set it will add the separator
     * as a character into the buffer as per defined on the standard
     * <a href="https://tools.ietf.org/html/rfc4180">RFC4180</a>
     * definition. otherwise it will mean that the Cell is completed and
     * therefore needs to commit the Buffer into the grid. by calling
     * commitBuffer(grid, buffer);
     *
     * @param escape the status of the Escape flag, if true will add the
     * character into the buffer otherwise commit the data.
     * @param grid a LinkedList<LinkedList<String>> that represents the Grid of
     * data.
     * @param buff a StringBuilder that holds the buffer (read data from the
     * file to be committed into the grid)
     * @param readchr the character read. can be inferred it is a SEPARATOR
     */
    private void parseSeparator(boolean escape, LinkedList<LinkedList<String>> grid, StringBuilder buffer, char readchr) {
        //its a SEPARATOR HOWEVER is it escaped?
        if (escape) {
            //if escaped just add into the buffer
            buffer.append(readchr);
        } else {
            //not escaped so this is a SEPARATOR then lets flush the text buffer
            //into the Row data as a new "cell" 
            commitBuffer(grid, buffer);
        }
    }

    /**
     *
     * @param escape the status of the Escape flag, if true will add the
     * character into the buffer otherwise commit the data. and add a new Row to
     * be filled
     * @param grid a LinkedList<LinkedList<String>> that represents the Grid of
     * data.
     * @param buff a StringBuilder that holds the buffer (read data from the
     * file to be committed into the grid)
     * @param br the bufferedReader to read Characters from needed to read the
     * next character to determine and consume the next character if the Line is
     * correctly terminated.
     * @param readchr the Read character from BufferedReader, this can be either
     * if the characters that consist of a Line limiter (CR or LF)
     * @throws IOException if an Exception is detected upon read the buffer.
     */
    private void ParseLine(boolean escape, LinkedList<LinkedList<String>> grid,
            StringBuilder buffer, BufferedReader br, char readchr) throws IOException {
        //its a Carriage Return or a Line Feed
        if (escape) {
            //if escaped just add into the buffer
            buffer.append(readchr);
        } else {
            //not escaped so this should be CR or LF (or CR followed by LF)
            if (readchr == CR) {
                br.mark(2);
                char nextchar = (char) br.read();
                if (nextchar != LF) {
                    Logger.getLogger(CsvParser.class.getName()).log(Level.WARNING,
                            "WARNING there is a Carriage Return but is not Followed by Line Feed!");
                    br.reset();
                }
            } else {
                Logger.getLogger(CsvParser.class.getName()).log(Level.WARNING,
                        "WARNING there is a Line Feed but is not Preceded by Carriage Return!");
            }
            commitBuffer(grid, buffer);
            grid.add(new LinkedList<>());
        }
    }
}